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BROADBAND MID-NETWORK SERVER 

The present invention relates to internetworked 
communication systems^ and especially (but not 
exclusively) to a highly scalable, broadband mid-network, 
server for performing mid-network processing functions 

5 including routing functions, per user processing, 

encryption, bandwidth distribution and traffic shaping. 
Background and Summary of the Invention 

As bandwidths within the core of the Internet increase, 
there is an increasing trend towards using the Internet 

10 Protocol ("IP") as the core network layer protocol for all 
kinds of traffic, including voice, video and data. 

Historically, quality of service on the Internet has been 
what is called "best effort," That is, the network attempts 
to transport as much traffic as possible, but if there is 

15 insufficient capacity to handle the traffic, all connections 
are equally likely to be influenced by congestion. Thus, 
"best effort" implies that the Internet provides only one 
class of service to any connection, and that all connections 
are handled equally with no priority. 
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In the case of traditional Internet applications, this 
approach was often sufficient. However, the intrinsic 
potential of the Internet is considerably greater, and 
includes new multimedia and interactive applications. Voice 
5 over IP ("VoIP") and Real Time Video are envisioned to be two 
significant applications for propelling Internet growth to the 
next level. VoIP can be defined as the ability to make 
telephone calls and send faxes over IP networks. The benefits 
of this teclinology are cost reduction, simplification, 

10 consolidation and advanced applications such as shared screens 
or whiteboarding which combine voice and data. Real Time 
Video is a 'Mirect-to-user" technique in which a video signal 
is transmitted to the client device and presentation of the 
video begins after a short delay for data buffering, and 

15 eliminates the need for significant client-site storage 
capacity. It is also expected to become popular with 
businesses. Related to this is webconf erencing, which 
requires high bandwidth since it is a continuous transfer of 
image information together with voice transfer. 

20 Webconferencing also requires real time traffic handling 
because it is usually implemented as an interactive 
application. 

All of these new applications will generally require 
significant bandwidth and/or reduced latencies. Bandwidth is 

25 the critical factor when large amounts of information must be 
transferred within a reasonable time period. Latency is the 
minimum time elapsed between requesting and receiving data and 
is important* in real-time and interactive applications such as 
webconferencing and telecommuting. Presently, most 

30 telecommuters depend upon analog modems with limited bandwidth 
and significant latency for dial-up connectivity. Even for 
today's applications, dialup connectivity is often inadequate. 

There are competing "last mile" technologies today which 
provide transport services to the user for delivering packets 

35 to the "edge" of the Internet. To con^lete the communication, 
the packets need to be formatted to allow them to enter the 
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Internet cloud and find their way to their respective 
destinations. The emergence of supporting protocols for new 
applications and the growth spurt in jiumber of users and the 
required bandwidth per user results in a very dynamic access 
5 environment - 

The following is a summary of observations that pertain 
to an ideal mid-network point within the Internet: 

• In order to accoinmodate a variety of source packets, 
all the requisite protocols must be efficiently 

10 supported. 

• virtual Private Network services allow a private 
network to be configured within a ptGDlic network. 
This is one of the drivers for Internet access amongst 
businesses. To allow Virtual Private Networks to 

15 coexist on the pxiblic Internet, and to encourage 

business use of the Internet, great care must be taken 
with respect to security and authentication issues, 
and tunneling protocols such as L2TP and IPSec must be 
efficiently supported . 

20 • The number of subscribers handled by one system and 

the different qualities of service provided will make 
service provider administration more complex. To make 
provisioning of broadband access more attractive to 
service providers, subscriber management and usage 

25 accounting must be sin^lified, and differentiated 

services must be provided. 

• Broadband makes it possible to provide different 
amounts of bandwidth to users and to smaller Internet 
Service Providers. To make wholesaling of IP 

30 connectivity possible, and to simplify service and 

repair functions, the ability to support multiple 
service providers with one mid-network server must be 
provided. 

• A large number of connections are serviced with a 
35 broadband mid-network server. In order to ensure that 

service is not interrupted, the broadband server must 
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have very high availability. Such' availability is 
also required for niission-critical business 
applications. 

• Central office co-location space is limited. To 

5 conserve this space, large connection densities must 

be provided. 

• When sxibscribers are allowed access at high speeds, it 
is possible for a limited number of users demanding 
disproportionate amounts of bandwidth to disrupt 

10 service for other customers. To ensure that large 

traffic bursts do not overload small client buffers, 
and to ensure that service providers and customers are 
treated fairly, traffic shaping must be provided. 

• To enable new value-added services, large bandwidths 
15 and low latencies are critical. 

In order to solve these and other needs in the art, the 
inventors hereof have succeeded at designing and developing a 
broadband mid-network server that, in the most preferred 
embodiment, satisfies all of the requirements described above. 

20 This inventive server provides reliable, secure, fast, 

flexible, high-bandwidth, and easily managed access to the 
Internet so as to accommodate all current Internet services 
including email, file transfer, web surfing and e-coinmerce, as 
well as the new value added services such as VoIP and Real 

25 Time Video. To meet these requirements, the broadband mid- 
network server of the present invention has been designed to 
scale not only in bandwidth, but also in processing power and 
state space. In the preferred embodiment, the architecture 
allows a service provider to configure the cards chosen for 

30 use in the available chassis space to suit his particular 
application. For example, to maximize processing power, a 
service provider could increase the number of IPE cards at the 
expense of a fewer number of line cards; as few as one line 
card. In the case of one line card, the maximum amount of 

35 processing power would be available to a service provider. IN 
the preferred embodiment described in detail below, this 
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configuration would provide 240 processors and 39 gigabytes of 
memory. This would allow for a greater number and complexity 
of value added services which require more processing power. 
Alternatively, a greater number of line cards could be 
5 selected for use in a chassis which would be desirable to 

handle greater traffic and throughput at the expense of fewer 
value added services* 

The high bandwidth core routers that are currently under 
development by third parties are optimized for performing 

10 large numbers of fast routing lookups, but are not expected to 
provide generalized and flexible computing power for 
supporting the substantial amount of processing needed for, 
among other things, per user and per packet processing. In 
contrast, the broadband mid-network server of the present 

15 invention includes the ability to distribute traffic across a 
number of Internet processing engines and, more specifically, 
across a number of protocol processing units provided in each 
engine (the bandwidth to which can be coordinated) , to provide 
compute power and state space required for performing per user 

20 processing for a large number of users. 

One important feature of the present invention is a 
unique architectural philosophy, which provides that 
processing, be performed as close to the physical layer as 
warranted by considerations of flexibility, cost and 

25 complexity. This architectural philosophy maintains 
balance between two kinds of processing which are 
important to scaling bandwidth with value-added services 
in broadband networks; time-consuming, repetitive 
processing; and flexible processing which must be easy to 

30 program by third parties. The need for considerable 

time-consuming repetitive processing, which has proved to 
create a bottleneck in the processor-based servers of the 
prior art, is addressed by the inventive architecture 
through specialized hardware, and results in dramatic 

35 increases in speed and decreases in delay. The need for 
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flexible, easy to use, computing power to enable service 
providers to scale with value-added services is addressed 
by the inventive architecture preferably through the 
provision of high-performance general purpose processors 
5 which are paralleled and which can be scaled to a 
virtually limitless degree. Alternatively, network 
processors or digital signal processors or any other 
programmable processor could be utilized as well. 
Accordingly, the broadband mid-network server of the 

10 present invention provides a system that is currently 

unrivalled in performance and which can become the prime 
mover of Internet services such as managed, secure VPNs, 
Voice over IP and Real Time Video. 

While some of the principal features and advantages 

15 of the present invention have been described above, a 
greater and more thorough appreciation of the invention 
may be attained by referring to the drawings and the 
detailed description of the preferred embodiments which 
follow - 

20 Brief Description of the Drawings 

Fig. 1 illustrates a single shelf broadband mid- 
network server according to one embodiment of the present 
invention; 

Fig. 2 is a functional block diagram of the 
25 preferred server shown in Fig. 1; 

Fig. 3 is a functional block diagram of an exemplary 
line card shown in Figs. 1 and 2; 

Fig. 4 is a functional block diagram of an exemplary 
IPE card shown in Figs. 1 and 2; 
30 Fig. 5 illustrates routed distribution to an IPE 

card; 

Fig. 6 illustrates the processing flow on an IPE 

card; 
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Fig. 7 illustrates a protocol processing platform 

according to the present inventions- 
Fig, 8 is a functional block diagram of an exemplary 

buffer access controller; 
5 Fig. 9 illustrates the format of a cell received at 

an input to a BAG from a PIC; 

Fig. 10 is a functional block diagram of a preferred 

packet manager; 

Fig. 11 is an illustration of the deployment of a 
10 broad-band mid-network server at a Service Provider POP; 

Fig. 12 is an illustraion of the different kinds of 

links an ISP may want on a secure segment; 

Fig. 13 is an illustration of the system wide 

bandwidth distribution functions; 
15 Fig. 14 is an illustration of the multi-level 

policing and multi-level shaping that occurs in the 

system; 

Fig. 15 is an illustration of router distribution, 
two level policing, routing and two level shaping; 
20 Fig. 16 is a functional block diagram of a preferred 

packet inspector; 

Fig. 17 is an illustration of the preferred 
Distributor Flow Unit; and 

Fig. 18 is a summary of the highlights of the DFU. 
25 Detailed Description of the Preferred Eanbodiments 

The mid-network processor of the present invention 
is preferably iitplemented in a single shelf system as 
shown generally in Fig. 1, and is indicated generally by 
reference character 300. As shown in Fig. 1, the mid- 
30 network processor 300 is provided with a number of 

physical connection ("PHY") cards 302-316 through which 
packets may enter and exit the mid-network processor 300 
according to a particular communication protocol, as is 
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known in the art. For the preferred embodiment 
illustrated in Fig. 1, the mid-network processor 300 
supports the POS, ATM, and Gigabit Ethernet layer two 
protocols, although the mid-network processor may readily 
5 be configured to support additional protocols, as will be 
apparent. The PHY cards 302-316 are each associated with 
line cards 322-336, respectively, as shown in Fig. 1. As 
is well known in the art, each PHY card is media 
specific. In other words, each PHY card is provided with 

10 connectors and other components necessary to interface 
with the communication media connected thereto, and over 
which packets enter and exit the PHY card. Each line 
card is configured to process packets of the type 
received from its associated PHY card, as explained more 

15 fully below. 

The preferred mid-network processor 300 shown in 
Fig. 1 is also provided with a number of Internet 
Processing Engine ("IPE") cards 340-354, as well as two 
flash memory modules 360, 362 and four switch fabric 

20 modules 364-368. As appreciated by those skilled in the 
art, the. number of switch fabric cards required is a 
function of the switch fabric card design as well as the 
desired redundancy overall performance. Fig. 1 also 
illustrates a midplane 370 that is provided for 

25 interconnecting the various cards described above. The 
preferred mid-network processor 300 utilizes a card-based 
approach to facilitate maintenance and expansion of the 
mid-network processor 300, as necessary, but this is 
clearly not a limitation of the present invention. 

30 The manner in which packets are processed by the 

preferred mid-network processor 300 will now be described 
with reference to Fig. 2, which is a functional block 
diagram of the preferred mid-network processor 300 shown 
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in Fig. 1 (although, to simplify the illustration. Fig. 2 
does not show the PHY cards 310-316, the line cards 330- 
336 and the IPE cards 346-354 shown in Fig. 1). Packets 
enter the mid-network processor 300 via the PHY cards, as 
5 is known in the art. Each PHY card then delivers its 
packets to its associated line card through the midplane 
370. After performing initial processing of the packet, 
the line card delivers the packet again through the 
midplane to the switch fabric which, in turn, delivers 

10 the packet to one of the IPE cards for performing certain 
mid-network processing functions, such as routing 
functions, per user processing, encryption, and bandwidth 
distribution. After performing mid-network processing 
for the packet delivered thereto, the IPE card sends the 

15 packet back into the switch fabric, typically for 

delivery to one of the line cards for some additional 
processing before allowing the packet to exit the mid- 
network processor 300 through one of the PHY cards. In 
some cases, depending upon how the mid-network processor 

20 of the present invention is implemented, a single IPE 
card may be insufficient to complete the necessary mid- 
network processing functions for a packet delivered 
thereto. In this case, upon performing some processing, 
the IPE card will deliver the packet to another IPE card 

25 (rather than to one of the line cards) via the switch 
fabric for further processing. Thus, although a packet 
will typically be processed by only one IPE card, it is 
possible to process a packet in multiple IPE cards, if 
necessary. 

30 In this preferred embodiment, all of the line cards 

contain identical hardware, but are independently 
programmable. Likewise, all of the IPE cards contain 
identical hardware, but are independently programmable. 
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This contributes to the scalability and elegantly simple 
design of the preferred mid-network processor 300. 
Additional processing power can be provided to the mid- 
network processor by simply adding additional IPE cards. 
5 Similarlyr additional users can be supported by the mid- 
network processor 300 by adding additional line cards and 
PHY cards, and perhaps additional IPE cards to provide 
additional processing for the newly added users, if 
necessary, 

10 The flash memory cards are provided for storing 

configuration data used by the IPE cards during system 
initialization. 

Note that, as used herein, the term "packet" refers 
to any type of packet that enters or exits the mid- 
15 network processor 300, including packets input to the 
mid-network processor 300 in the form of cells (such as 
ATM cells) via an interleaved or non-interleaved cell 
stream. 

In general,, each line card used in the preferred 
20 mid-network processor 300 performs a number of functions. 
Initially, the line card converts packets (possibly of 
varying lengths) delivered thereto into fixed length 
cells. In this preferred embodiment, each line card 
converts input packets (including packets represented by 
25 individual cells) into 64 byte cells. The line card then 
examines the stream of fixed length cells "on the fly" to 
obtain important control information, including the 
protocol encapsulation sequence for each packet and those 
portions of the packet which should be captured for 
30 processing. This control information is then used on the 
line card to reassemble the packet, and to format the 
reassembled packet into one of a limited number of 
protocol types that are supported by the IPE cards. 
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Thus^ while any given line card can be configured to 
support packets having a niamber of protocol layers and 
protocol encapsulation sequences, the line card is 
configured to convert these packets into generally non- 
5 encapsulated packets (or, stated another way, into 
packets having an encapsulation sequence of one) of a 
type that is supported by each of the IPE cards. The 
line card then sends the reassembled and formatted packet 
into the switch fabric (in the form of contiguous fixed 

10 length cells) for delivery to one of the IPE cards that 
was designated by the line card for further processing 
that particular packet. 

Although the fixed length cells which comprise a 
packet are arranged back to back when the packet is 

15 delivered to the switch fabric by a line card, the cells 
may become interleaved with other cells destined for the 
same IPE card during the course of traversing the switch 
fabric. As a result, the cell stream provided by the 
switch fabric to any given IPE card may be an interleaved 

20 cell stream. Thus, the IPE card will first examine this 
cell stream "on the fly" (much like the cell stream 
examination conducted by the line cards, explained above) 
to ascertain important control information. The IPE card 
then processes this control information to perform 

25 routing look-ups and other mid--network processing 

functions for each packet delivered thereto. The control 
information is also used by the IPE card to reassemble 
each packet, and to format each packet according to the 
packet's destination interface. The IPE card then sends 

30 the reassembled and formatted packet back into the switch 
fabric in the form of contiguous fixed length cells for 
delivery to one of the line cards (or for delivery to 
another IPE card, in the case where additional mid- 
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network processing functions must be performed for the 
packet in question) . 

As noted above, although the cells of any given 
packet may enter the switch fabric in a back to back 
5 arrangement/ these cells may become interleaved with 
other cells during the course of traversing the switch 
fabric. Thus^ the stream of cells provided by the switch 
fabric to any given line card may be an interleaved cell 
stream. Accordingly, a line card will first examine this 

10 cell stream "on the fly" to ascertain important control 
information that will be used primarily to reassemble 
packets, and to format the reassembled packets for their 
destination interfaces- Additional processing of 
outbound packets is also conducted on the line card for 

15 PHY scheduling and bandwidth distribution purposes - 

While the preferred mid-network processor 300 of the 
present invention has been described as delivering 
packets from a line card to an IPE card and then back to 
a line card (or to one or more additional IPE cards) , the 

20 mid-network processor 300 can also be configured to route 
cells arriving over an ATM interface on one line card 
through the switch fabric and directly to another line 
card ATM interface , and can therefore function as an ATM 
switch. 

25 Fig- 3 illustrates an exemplary line card 380 used 

in the preferred mid-network processor 300 of the present 
invention. As s^hown therein, the line card 380 
preferably includes an ingress side (i.e., the left half 
of Fig. 3) and an egress side (i.e., the right half of 

30 Fig. 3) . When packets are provided to the ingress side 
of the line card from the line card's associated PHY 
card, the packets are first provided to a packet 
inspector chip ("PIC") 400 which converts the packets 
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(which may already be represented by individual cells 
such as ATM cells) into fixed length cells. In this 
preferred embodiment, the fixed length cells are 64 byte 
cells that are 8 bytes wide and 8 bytes long. Thus, a 
5 "cell time," in the context of cells propagating within 
the preferred mid-network processor 300, corresponds to 8 
clock cycles, as appreciated by those skilled in the art. 
The PIC 400 then examines the stream of fixed length 
cells "on the fly" to identify the "classification" (that 

10 is, the protocol encapsulation sequence), capture matrix, 
and other control information for each packet (as 
described more fully in copending Application No. 
09/494,235 filed January 30, 2000 entitled "Device and 
Method for Packet Inspection, " the disclosure of which is 

15 incorporated herein by reference) . More specifically, 
the preferred PIC 400 generates a control cell for each 
examined cell of a packet, and each control cell 
represents the control information that has been 
determined thus far for the corresponding packet- Thus, 

20 the PIC 400 outputs both the stream of fixed length cells 
that was produced before this stream was examined "on the 
fly" therein, as well as corresponding control cells. As 
shown in Fig. 3, these control and data cells are then 
provided by the PIC 400 to four preferably identical 

25 buffer access controllers ("BACs") 402-408. Each BAC 

stores a different quarter (i.e., 25%) of the data cells 
received from the PIC 400 in its corresponding cell 
buffer ("CB"). 

Each control cell output by the PIC 400 also 

30 includes a protocol processing unit ("PPO") identifier 
which identifies a PPU associated with a particular BAC 
for processing that control cell. Note that each PPU, in 
this preferred embodiment, preferably comprises two 
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general purpose central processing units ("CPUs"), as 
shown in Fig, 3. Alternatively, a PPU could comprise one 
or more network processors, digital signal processors, or 
any programmable processors. The BACs 402-408 each 
5 examine the PPO identifiers contained in the control 

cells delivered thereto over a bus by the PIC 400. When 
a BAC determines that the PPU identifier in a particular 
control cell is identifying the PPU associated with that 
BAC, the BAC will forward the control cell to its 

10 associated PPU for processing, as described more fully 
below. Thus, while every BAC 402-408 in this preferred 
embodiment stores a quarter of every data cell in their 
associated cell buffers, each control cell output by the 
PIC 400 is acted on by only one BAC and its associated 

15 PPU. As a result of being so processed, the size of the 
control cell is much smaller than the typical size of a- 
packet. This can significantly increase the utilization 
of the processor by reducing the I/O bandwidth which is 
the typical limiting factor in processor use. In this 

20 preferred embodiment, all control cells corresponding to 
a specific packet (and, more generally, to a specific 
user) are processed by the same BAC PPU on the line card 
380. 

Note that the PPU assigned by the PIC 400 for any 
25 given packet is performed according to configuration and 
control information received by the PIC 400 from a master 
PPO ("MPPO") 410, and can be changed by the MPPU 410 over 
time as necessary for PPU load balancing on the line card 
380. 

30 The PIC 400 also keeps track of the available memory 

addresses in the cell buffers associated with the BACs 
using a free buffer ("FB") list 412, and also keeps track 
of where each data cell is stored in the cell buffers 
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with respect to other cells of the same packet using a 
link list 414. 

When a control cell is processed within a particular 
BAG PPU, the PPD produces a new control cell to be 

5 provided to a packet manager ("PM") 420 which is in 
communication with the PIC 400 and the BACs 402-408. 
Included in this control cell provided to the PM 420 is a 
dequeue pointer which designates the location of the 
first cell of a packet that is to be dequeued and sent to 

10 the PM 420 along with the second and subsequent cells of 
that packet (if applicable) . The packet manager 420 then 
forwards this dequeue pointer back to the PIC 400, which^ 
in turn^ provides instructions to the BACs 402-408 to 
dequeue each quarter cell of the designed packet in 

15 sequence using the information previously stored by the 
PIC 400 in the link list 414. Thus, the designed packet 
is reassembled as it is dequeued and delivered to the 
packet manager 420. 

At this point in the processing, the packet manager 

20 420 stores the cells of the reassembled packet in its own 
cell buffer 422 (using a free buffer list 424 and link 
list 426) . The packet laanager 420 processes the control 
information it received for that packet from one of the 
BAG PPUs and then formats the packet according to this 

25 control information by modifying or augmenting the packet 
header as the cells of the packet are dequeued from the 
cell buffer 422. This process and additional details of 
the preferred packet manager 420 are described more fully 
in copending Application No. 09/494,236 filed January 30, 

30 2000 entitled "Device and Method for Packet Formatting," 
the disclosure of which is incorporated herein by 
reference. The packet manager 420 also appends a header 
to each of the 64 byte cells that constitute the 
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reassembled and formatted packet, and these headers will 
be used by the switch fabric for routing the cells 
therethrough. The packet manager 420 then forwards the 
cells of the packet in sequence to a UDASL 430, which is 
5 provided for managing cell traffic into and out of the 
switch fabric for the line card 380. Typically, the 
UDASL 430 then forwards the packet cells into the switch 
fabric for delivery to an IPE card that will perform mid- 
network processing functions for the packet in question. 

10 This IPE card is preferably designated by the BAG PPU 
that prepared and forwarded control information to the 
packet manager 420. 

Also shown in Fig. 3 is a 9-port Ethernet switch 450 
which provides for interprocessor communications between 

15 the eight PPUs on the line card 380 (i.e., 4 PPUs on the 
ingress side and 4 PPUs on the egress side) and the MPPU 
410 for purposes of load balancing, hardware monitoring 
and bandwidth distribution, and for sharing user and 
configuration information. The bandwidth distribution 

20 process and the preferred hardware are described more 
fully in copending Application No. 09/515,028 filed 
February 29, 2000 entitled "Method and Device for 
Distributing Bandwidth," the disclosure of which is 
incorporated herein by reference. 

25 Fig. 4 illustrates an exemplary IPE card 500 used in 

the. preferred mid-network processor 300 of the present 
invention. The hardware layout of the IPE card 500 is 
similar to the hardware layout on the ingress side (and 
the egress side) of the line card 380 shown in Fig. 3. 

30 That is, the IPE card 500 is also provided with a UDASL 

501 that delivers a typically interleaved cell stream 
received from the switch fabric to a PIC 502. The PIC 

502 is in communication with four BACs 504-510 that are 
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in communication with a PM 512. Thus, the primary 
difference between the preferred IPE card 500 and either 
side of the preferred line card 380 is the processing 
that is performed therein, even though this processing is 
5 performed with similar hardware. It should thus be 
apparent that the present invention provides, amongst 
other things, an inventive hardware module that can be 
programmed to perform requisite processing either on the 
ingress side or the egress side of a line card, or on an 

10 IPE card. This contributes to the configurability and" 
scalability of the preferred mid-network processor 300, 
which can be reconfigured as necessary (both through 
programming and/or by adding additional lines cards 
and/or IPE cards) to accommodate additional users and/or 

15 to provide additional processing power. 

Much like the PIC 400 resident on the ingress side 
of the preferred line card 380, the PIC 502 provided on 
the preferred IPE card 500 is used to inspect the stream 
of fixed length cells provided thereto by the switch 

20 fabric "on the fly" to ascertain control information for 
each packet to be processed on the IPE card- In most 
cases, this control information was added to the packet 
by the PM 420 on the ingress side of the line card that 
forwarded the packet to this particular IPE card- The 

25 PIC 502 outputs the stream of data cells to the four BACs 
504-510, each of which is configured to store a different 
quarter of each data cell in its corresponding cell 
buffer (note that each BAG on the preferred IPE card 500 
has two PPUs associated therewith, whereas only one PPU 

30 is associated with each BAG on the preferred line card 

380) . The PIC 502 also outputs control cells to the BACs 
504-510, where each control cell contains a PPU 
identifier that designates one of the two PPOs associated 


wo 01767694 PCT/USOl/01003 

18 

with a particular BAC for processing that control cell on 
the IPE card to perform mid-network processing functions 
for the corresponding packet. In this preferred 
.embodiment, all control cells corresponding to a specific 
5 packet {and, more generally, to a specific user) are 
processed by the same BAC PPU on the IPE card 500. 

For any given packet, the PPU that processed control 
information for that packet on the ingress side of the 
line card is also responsible for determining to which 

10 IPE card and, more specifically, to which PPU on a 
particular IPE card, the packet should be sent for 
further processing. 

After a BAC PPO on the IPE card processes the 
control information for a particular packet, the PPU 

15 sends a control cell back to the PM 512, which then 

cooperates with the PIC 502 to dequeue the quarter cells 
of that packet in sequence from the cell buffers 
associated with the BACs 504-510. Upon receiving the 
constituent cells of a reassembled packet and storing 

20 these cells in its own cell buffer 514 (using a link list 
516 and a free buffer list 518), the PM 512 processes the 
control cell received from the BAC PPU to format the 
reassembled packet according to its destination interface 
before forwarding the reassembled formatted packet back 

25 into the switch fabric for delivery to its destination 
line card (or another IPE card, in the case where 
additional processing of the packet is required) . 

Also shown in Fig. 4 is a 9-port Ethernet switch 550 
which, like the Ethernet switch provided on the preferred 

30 line card 380, provides for interprocessor communications 
between the eight PPUs and an MPPU 530 on the IPE card 
500 for purposes of load balancing, hardware monitoring 


wo 01/67694 PCTAJSOl/01003 

19 

and bandwidth distribution, and for sharing user and 
configuration information. 

Referring again to Fig. 3, it can be seen that the 
egress side of the exemplary line card 380 is also 
5 provided with a PIC 600, four BACs 602-608, and a PM 610. 
Upon receiving a possibly interleaved stream of fixed 
length cells from the switch fabric via the UDASL 430, 
the PIC 600 examines this cell stream "on the fly" to 
ascertain control information (including control 

10 information that may have been added to the packet header 
by the PM 512 on an exemplary IPE card 500) . The PIC 600 
then forwards the data cells to the BACs 602-608 for 
storage in their corresponding cell buffers, and forwards 
corresponding control cells for each packet to one of the 

15 BAC PPUs (typically assigned by an IPE card BAC PPU that 
previously processed control information for the same 
packet) for further processing. The assigned BAC PPO 
then performs additional packet processing, primarily for 
traffic shaping, PHY card scheduling and bandwidth 

20 distribution on that PHY card- This process and the 

preferred hardware are described more fully in copending 
Application No. 09/511,059 filed February 23, 2000 
entitled Method and Device for Data Traffic Shaping," 
the disclosure of which is incorporated herein by 

25 reference. Upon processing the control information 
received from the PIC 600, this BAC PPU produces and 
forwards a control cell to the packet manager 610, which, 
in turn, dequeues the qiaarter cells of the corresponding 
packet in sequence from the cell buffers associated with 

30 the BACs 602-608 in cooperation with the PIC 600. The PM 
610 then stores the constituent cells of the reassembled 
packet in its own cell buffer 612 (using a link list 614 
and a free buffer list 616) , and formats the packet for 
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its intended destination before forwarding the 
reassembled formatted packet to the PHY card associated 
with this line card for outputting the packet from the 
mid-network processor 300. 
5 A description of one preferred implementation of the 

broadband mid-network server described above will now be 
provided, wherein the following terms have the following 
meanings : 

Cardid: An 8 bit number that uniquely identifies an IPE 

10 or Line Card in the system. 

Flowld: A 10 bit ntamber whose lower (least significant) 
8 bits contain a Cardid, and whose upper (most 
significant) 2 bits identify the priority (class) of the 
traffic sent through the switch fabric to this card using 

15 this Flowld. (In the switch fabric, this field is 12 

bits, but our implementation only uses the least 
significant 10 bits.) 

C7ser: A datalink (layer 2) interface. Examples include 
ATM virtual circuits, PPP sessions (over SONET, Ethernet, 

20 or ATM) , and MPLS label switched paths • 

OserJd; A 32-bit value that can be used as a system-wide 
pointer to user configuration and state information. 
Since multiple cards (one or more IPEs and one Line Card) 
can store information about a user, it is possible to 

25 have multiple User Ids that refer to a single user. The 

upper (most significant) 8 bits of the value represent 
the CardId of the card which contains the user 
information being identified. The next 4 bits represent 
the PPDID of the PPD on the card where the information is 

30 stored, and the lower (least significant) 20 bits 

represent the CID assigned by that card to the user. The 
CID is used as an index into the PPD's table of user 
information . 

LCUserld: A Oserld in which the Cardid identifies a Line 
35 Card. 
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Primary Userld: A Oserld in which the Cardid and PPUID 
identify the PPD on an IPE with has the primary 
responsibility for managing a user. 

Secondary Userld: A Oserld in which the Cardid and PPOID 
5 identify an IPE PPU other than the one identified by the 

Primary Userld 

Small User: A user whose ingress packet stream is 
processed entirely by a single IPE PPU. Small users do 
not have Secondary Userlds. 
10 Large User: A user whose configured bandwidth is too 

high for his ingress packet stream to be processed by a 
single IPE PPU. All large users have one or more 
Secondary Userlds, 

Logical Link: A group of users of the same type (i.e.: a 
15 group of ATM Virtual Circuits) . If the Logical Link is a 

group of PPPoE sessions over ATM, the Logical Link must 
be an ATM Virtual Circuit. 

CSIX Header: The header of a CSIX (i.e., Common Switch 
Interface) cell. The CSIX Header is separate from the 64 
20 byte cell payload. 

Ceil Header: The first two bytes of the 64 byte payload 
of a CSIX cell. 

PIE Header: The 6 bytes immediately following the 
Cell Header of the first cell of a packet. 

25 Overview : 

In this particular implementation, the server system 
preferably comprises one or more rack mountable system 
units (i.e., shelves). The system also contains at least 
one line card, exactly as many PHY cards as line cards, 

30 and at least as many IPE cards as line cards. Also, each 
shelf of the system contains preferably three switch 
fabric cards and two flash disk cards. Each line card is 
uniquely associated with a particular PHY card. However, 
there is no particular association between line cards and 

35 IPE cards. 
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Each IPE card can be thought of as an independent router, 
with one or more IP addresses associated with it. Bach Layer 
2 (datalink) interface (referred to as a "user") provided by a 
line card is associated with exactly one IPE card (more 
5 specifically^ exactly one PPU on one IPE card) . Different 

users from the same line card can be associated with different 
PPUs on different IPE cards, and a particular PPU can have 
users from multiple line cards. 

Since it is possible for multiple Layer 2 protocols to be 

10 encapsulated within each other (for example, 

PPP/Ethernet/ATM) , there is an exception to the ^'one user, one 
PPU" rule. In this case, the inner-most levels of 
encapsulation, each of which being layer 2 interfaces (users) 
in their own right, can be associated with different PPUs 

15 within an IPE card, or even PPUs on different IPE cards, thus 
causing traffic from the outer levels of encapsulation to be 
split among multiple PPUs or IPE cards. It is also possible 
for outer layers to be encapsulated layer 3 traffic as well as 
layer 2 traffic (for example, an Ethernet /ATM virtual circuit 

20 can carry IP as well as PPPoE packets) . In this case, all the 
layer 3 traffic will be associated with a single PPU (a user) , 
but the encapsulated layer 2 datalinks (users) can each be 
associated with a different IPE card. 

The set of all users on the system is preferably 

25 distributed as evenly as possible across all the IPE cards in 
the system. Within an IPE, the MPPU stores the per-user 
information for the users assigned to that IPE and distributes 
those users across its PPUs. Each PPU stores a copy of the 
per-user information assigned to it. Thus each user is 

30 associated with one and only one IPE card and one and only one 
PPU on that IPE. This PPU' s copy of the user's configuration 
and state information can be uniquely identified on a system- 
wide basis by the Primary User Id. 

The architecture of this preferred implementation is 

35 based on line cards, PHY cards, a switching fabric, internet 
processing engines (IPE) and flash memory modules, as was 
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described generally above. The line cards terminate the link 
protocol and distribute the received packets based on user, 
tunnel or logical link information to a particular IPE through 
the switching fabric. The procedure of forwarding a packet to 

5 a particular IPE and PPU will be denoted as "routed 
distribution." A midplane is also used to connect the 
different cards. The preferred line card and the preferred 
IPE card were described above with reference to Figs. 3 and 4.. 
The system is comprised of a set of hardware components/ 

10 as described, which can be used to configure a system for a 
wide variety of applications as well as throughput 
requirements cost effectively. The preferred switch fabric 
and scheduler support cell switching at OC-192 speeds, and the 
switch fabric is both fully redundant and highly scalable. 

15 The preferred IPE cards have the following attributes: high 
performance protocol processing engine; manages users, tunnels 
and secure segment groups; supports policing and traffic 
shaping; implements highly sophisticated QoS with additional 
support for differentiated services; supports distributed 

20 bandwidth management processing; and supports distributed 

logical link management, able to do NAT, packet filtering and 
firewalls . 

The preferred line cards have the following attributes: 
packet lookup processing; protocol identification; scheduling; 

25 supports distributed bandwidth management processing; 

multi-I/F support (ATM, GE, POS) ; and AAL-5 Processing (CRC 
check and generation) . 

The preferred PHY cards have the following attributes: 
line termination for rates up to OC 192c; ATM Layer 

30 Processing; ATM - SONET Mapping; POS - SONET Mapping 

(including EEC checksum computation) ; GE - MAC and PHY 
Processing; and support the following line cards: ATM: 4x OC- 
48, 8x OC-12, 16x OC-3; POS: lxOC192, 4x OC-48, 16x OC-12; and 
GE: 8/lOx GE. Additionally, the overall system preferably has 

35 the following attributes: high availability; 1+1 switch 
fabric and scheduler redundancy; 1+1 control system unit 
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redundancy; all field replaceable vmits are hot-swappable; N+1 
AC power supply redundancy; and N+1 fan redundancy. 

One pujrpose of routed distribution is to forward a packet 
to a particular PPU within an IPE, The key benefits of this 

5 approach are: incremental provisioning of compute power per 
packet; allows load distribution based on the packet 
computation needs for a particular user or turmel; user and 
tunnel configuration information can be maintained by one 
single processor thus minimizing the inter-process 

10 communication needs; and allowing the portability of single 
processor application S/W onto the system.* 

Fig- 5 illustrates the distribution of packets to a 
particular IPE. A packet is received from a line card. The 
line card examines the packet and forwards the packet based on 

15 the IP source or destination address, the user session ID, or 
the tunnel ID. The IPE receives the packets and hands it over 
to the PPU specified by the line card. 

The line cards and the IPE host the flexible protocol- 
processing platform. This platform is comprised of a data path 

20 processing engine and the already mentioned protocol- 
processing unit- The separation of data path processing from 
protocol processing leads to the separation of memory and 
compute intensive applications from the flexible protocol 
processing requirements. A clearly defined interface in the 

25 form of dual-port memory modules and data structures 

containing protocol specific information allows the deployment 
of general-purpose CPU modules for supporting the ever 
changing requirements of packet forwarding based on multi- 
layer protocol layers. 

30 The protocol-processing platform can be configured for 

multiple purposes and environments. That is, it supports a 
variable nimiber of general purpose CPUs which are used in the 
context of this architecture as Protocol Processing Units 
(PPU) . One of these CPUs is denoted as the Master Protocol 

35 Processing Unit (MPPU) . 
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The data path processing unit extracts, in the packet 
inspector, all necessary information from the received packets 
or cells and passes this information on to a selected PPO via 
one of the buffer access controller devices. The cells 

5 themselves are stored in the cell buffer and linked together 
as linked lists of cells, which form a packet. Once a PPO has 
selected a packet for transmission, it passes the pointer to 
the packet and the necessary formatting and routing 
information to the data path processing unit* This enables 

10 the formatting and the segmenting of the packet- The packet 
is then forwarded either as a whole or segmented based on the 
configured interface. 

Each PPU is associated with one dual-ported memory, where 
one port is controlled by the data-path processing unit and 

15 the other by the corresponding PPU. Each dual-ported memory 
contains two ring buffers, where one ring buffer is used to 
forward protocol specific information from the data path to 
the PPU and the other is used for the other direction. The 
ring buffer for passing on protocol specific information to 

20 the PPU is called the receive buffer. The other buffer is 

called the send buffer. Two pointers are maintained for each 
ring buffer. The write pointer for the receive buffer is 
maintained by the data path processing unit while the read 
pointer is set by the PPU, The send buffer's write pointer is 

25 controlled by the PPU and the read buffer by the data path 
processing unit. 
The PHY Card ; 

The PHY card terminates the incoming transmission line. 
It also performs clock recovery and clock synthesis. Optical 

30 signals are converted into a parallel electrical signal which 
is then an input to a physical framer device which maps the 
incoming bit stream into the transmitted physical frame. 
Finally the physical layer of the corresponding link protocol 
processes the physical frames. In addition, link layer 

35 protocol processing is performed in order to provide a common 
packet interface to the line card. On the transmission side. 
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the packets or cells are mapped into physical frames. These 
frames are then encoded into the corresponding physical layer 
format and sent over the optical fiber to the receiving peer. 
The physical layer format is preferably either SONET or 
5 Gigabit Ethernet. The link layer format is preferably GE, ATM 
or PPP for POS. 
The Line Card ; 

The line card performs packet forwarding for the egress 
and ingress path. Full duplex 10 Gbit/s throughput is 

10 provided. The line card interfaces to the PHY cards and the 
switch fabric card. The Line Card is preferably configured 
for either POS-PHY or UTOPIA III interface to the PHY card. 
The Line Card preferably hosts two Protocol Internet Engine 
(PIE) chip sets. On the ingress side, one PIE chip set 

15 supports four protocol-processing units (PPU) and one MPPU. 

The Four PPOs perform routed distribution to the various IPEs 
in the system. They also provide traffic shaping and 
scheduling of flows to the switching fabric. The remaining 
MPPU is used for overall control and supports the distributed 

20 bandwidth allocation protocol of the switching fabric. 

The Packet Inspector (PI) first examines incoming cells 
or packets and the protocol information is extracted based on 
matched patterns in the data flow. This information is then 
made available to the PPU which is responsible for processing 

25 the incoming packet. Cells or packets from a PHY card are 

processed by a particular PPU based on a chosen configuration. 
This configuration depends upon the configuration of the PHY 
card itself and upon the protocol supported by the PHY card. 
The other PIE chip set, processing the egress flow, is 

30 preferably responsible for cell assembly from the switch 
fabric and packet scheduling for multiple physical ports. 
Additional support for AAL5 processing is provided for ATM 
flows. The MPPU from the ingress path is shared for 
configuration, maintenance and cell extraction of the egress 

35 flow. 
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The communication channel provides signaling and 
connection setup control for the ATM PHY card* The PHY card 
informs the Line Card about the physical layer status and 
reports alarm and error conditions. 
5 The ingress packet processing preferably involves: 

Packet Assembly for ATM traffic (AAL5 processing) ; Protocol 
Identification (Packet Data Inspection) ; Routed Distribution; 
Scheduling of traffic flows through switching fabric; Buffer 
management for ingress cell buffers; and cell scheduling for 

10 the switch fabric. 

The egress packet processing preferably involves: 
Traffic Shaping; Packet Assembly for switch fabric flow; MPHY 
Buffering; Cell Scheduling for ATM with multiple physical 
interfaces with AAL5 processing (CPCS, SAR) ; and Packet 

15 Scheduling for POS with multiple physical interfaces. 

The Internet Processing Engine (IPE) provides the 
functionality for protocol processing, user management, tunnel 
management and secure segmentation. It receives the packets 
from the switching, enforces the service level agreements 

20 (SLA's), performs packet classification, filtering and 

forwarding, and finally schedules the packet for transmission 
to the requested interface. 

The PI is part of the Packet Internet Engine (PIE) chip 
set, which consists of the Packet Inspector^ the Buffer Access 

25 Controller, and the Packet Manager. Together with the sixteen 
PPUs and the MPPU, the PIE chip set provides a powerful 
Protocol Processing xinit. The PIE chip extracts informative 
protocol information and forwards it to the PPUs and the MPPU 
based on the routed distribution decision made in the Line 

30 Cards. The chosen PPO processes this information and performs 
all necessary packet processing. This includes, besides 
forwarding and filtering, policing, and packet foimatting. 
The MPPU controls the IPE and is negotiating with the units in 
the system the bandwidth allocation of the switch fabric. It 

35 also provides bandwidth management for the configured logical 
links. The MPPU manages its connections by assigning users and 
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tunnels to individual PPOs for forwarding processing from the 
Line Card to a particular IPE. Once a connection between the 
MPPD and Line Card is set up, all packets belonging to such a 
connection are forwarded from the Line-Card to the chosen PPU. 
5 A PPU is chosen based on the already assigned 

connections, their bandwidth and the bandwidth and QoS 
required for the new connection. Connectionless traffic 
(Internet to Internet) is mapped onto an internal connection. 
If more bandwidth is needed than one PPU can manage, the 

10 packets will be distributed over multiple PPUs. 

The functionality of the IPEs include: User Management; 
Tunnel- Management; Logical Link Management; Support for Secure 
Segmentation; Policing; QoS Control with Diff Service Support; 
Buffer Management; IPv4, IPv6 Forwarding; Packet 

15 Classification; Packet Filtering with support for user 

Filters; Celox Management Database Support; Packet Formatting, 
and NAT. 

The Protocol Internet Engine Chip Set (PIE) ; 

The Protocol Internet Engine (PIE) provides the data path 
20 processing capabilities for the server system at OC-192c 
rates. The PIE chip set comprises three chips- These chips 
result in a very high performance packet processing system 
together with an interface controller and multiple general 
purpose CPUs. 

25 Each cell is preferably transferred into the buffer 

through four buffer access controllers ("BACs") in order to 
increase. the bandwidth to the PPUs and to increase the 
bandwidth to the external cell buffers. Different portions of 
the same cell are written to the cell buffers attached to the 

30 different BACs. However, the captured portion of the data is 
sent to just one of the PPUs. 

The preferred BAC unit is shown in Fig. 8. The RSU 
receives incoming data, reformats the data to an internal 
format, performs a parity check for incoming data, and also 

35 performs synchronization control. The preferred format of a 
cell received by the BAC from the packet inspector is shown in 
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Fig. 9. Referring again to Fig. 8, the Cell Filter unit 
extracts control information from the cell and sends the cell 
data to the BAU along with the indication of which portion of 
the cell has to be stored in this cell buffer. The CFO also 
5 sends the cell data stream to the PTU which translates the 
PPUID to the appropriate FPU and thence to the CCU where, 
based on the PPUID and the capture matrix, the control cell is 
extracted from the data cell CCO and stored in -the CBD. The 
CMU then transmits the control cell to the appropriate PPDs 

10 through a dual port RAM interface. 

When the packet has to be dequeued, the control cell 
corresponding to the packet is sent by the PPU which processed 
that user to the PM along with the dequeue pointer. This is 
received by the BEU of the PM, as shown in Figure 8. 

15 The control cell data stream (shown as the narrow arrow 

in Fig. 8) then goes to the ICU where it is stored while the 
DSU does deficit round robin scheduling of the data packets 
corresponding to the control packets in order to distribute 
bandwidth equitably to the BACs for sending out packets- In 

20 addition, the dequeue pointer corresponding to the packet to 
be dequeued is sent to the PID from where it is transmitted to 
the PI where it is received at the PIU and passed on to the 
BMU. In the EMU, the dequeue pointers are stored in a FIFO 
while the previous packets are being dequeued. The dequeue 

25 pointer information is passed onto the BACs cind the BAD in the 
BACs dequeues the packet and passes it through the PMU to the 
packet manager. A packet is dequeued by dequeuing all the 
cells comprising the packet which are held in the form of a 
linked list. Data packets from the data packet stream (shown 

30 as the thick arrow in Fig. 8) undergo AAL5 processing (should 
they need it) in the APU, and are stored in the IDD buffer. 
The FAU reformats packets into 64 bit slices and controls 
dequeuing from both the IDU and the DSU's DPRAM in accordance 
with the PFD. In order to ensure matching of the control 

35 packet with the data packet, a sequence number is used at the 
beginning of both the data and the control cells. Both the 
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control and data streams enter the PFO where they are 
formatted and sent to the TID to be sent to the phy cards or 
the switch fabric. 

The PIE chip set can be configured for multiple purposes 
5 and environments. That is, it supports a variable number of 
general purpose CPUs which are used in the context with the 
PIE chip set as Protocol Proc'essing Units (PPU) . One of these 
CPUs is reserved for maintenance and control purposes and is 
denoted as MPPD. 

10 The PIE chip set implements all necessary functions in 

order to hide all data path processing from the actual 
protocol processing functionality. The PIE chip set extracts 
all necessary information from the received packets or cells 
and passes this information on to a selected PPU. The cells 

15 are then stored in the cell buffer and linked together as 

linked lists of cells, which form a packet. Once the PPU has 
selected a packet for transmission, it passes the pointer to 
the packet and the necessary formatting and routing 
information to the PIE chip set. This allows formatting and 

20 segmenting of the packet. The packet is then forwarded to the 
MPHY scheduler as a whole or segmented based on the configured 
interface . 

Each PIE chip set is differently configured- The PIE chip 
set on the IPE supports as many as 8 PPUs and 1 MPPU. 4 PPUs 
25 and 1 MPPU will support the PIE chip set on the ingress side 
of the Line Card, and an equal number on the egress side of 
the Line Card. 

The characteristics of the preferred PIE are as follows: 
Three Chip Chip-Set; Full Data-path processing in hardware; 

30 Support for distributed .protocol processing by general purpose 
CPU modules; Highly scalable compute power per packet (up to 
64 PPUs can be supported) ; Flexible interface support with 
MPHY scheduling; AAL-5 Processing; SAR Sublayer: Assembly and 
Segmentation for up to 256K connections; CPCS Sublayer: CRC 32 

35 generation and check, padding control, and length field 

control; Internal Packet Processing; Checksum computation and 
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check; Length field control; Padding control; Micro- 
programmable Packet Inspection Engine; Supports any layer 
packet inspection; Supports byte matched pattern processing; 
Supports bit matched pattern processing; Results are made 
5 available to protocol processing units; Supports extraction of 
any portion of packet for protocol processing; IPv4/IPv6 
Header Checksum; Congestion Avoidance Support; EPD; PPD; 
Internal Back-pressure control; Linked List Control; Supports 
up to 8 million 64 byte cells (initially a million) ; Links 

10 cells together to form a packet; Garbage Collection; Assembly 
aging control; Buffer Access; Parity generation and check for 
signal integrity; Cyclic access for data rates up to 12Gbit/s; 
PPU Access; Dual-Port access control for up to 8 dual-port 
RAMs each with 512/256 KByte memory; Support for dual-port RAM 

15 data synchronization; Dual-ring buffer control for each dual- 
port RAM for data exchange; Threshold-based access control for 
writes to ring buffer; Support for up to 24 Gb/s throughput 
(bi-directional) ; Back-pressuring in case of buffer overflow; 
Cyclic Packet Scheduling; Packet Scheduling for cyclic access 

20 control with support for data rates up to 12Gb/s; Micro- 
programmable packet formatter; Supports insertion, removal and 
overwriting for any byte in a packet at OC-192 speeds; 
Supports IPv4/IPv6 Header Checksum generation; Support UDP/TCP 
checksiHtt generation; Cell Scheduling, Buffering and Linked 

25 List Management; Supports cell buffering for up 512K cells; 
and supports scheduling for up to 1024 queues. 

Together with the PPDs, the preferred PIE supports: 
Packet Classification ; 

Based on Layer 3,4,... Information (any layer); Packet 
30 Filtering; User programmable filters; Group filters; Firewall 

processing; Packet Forwarding; IPv4 Lookup Processing; IPv6 

Lookup Processing; Tunnel Forwarding; Buffer Management; 

Dynamic Thresholding on a per user and assigned rate basis; 

Support for up to 8 million Cell Buffer (initially a million) ; 
35 Congestion avoidance with Early Packet Discard (EPD) , Partial 

Packet Discard (PPD), Selective Packet Discard; Policing; Per 
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User and Logical Link; Enforcing traffic contracts based on 
SLA; Traffic Shaping; Per User and Logical Link; Support for 
traffic contracts based on SLAs; Support for Real-time traffic 
(low delay traffic) ; QoS Control; Supported for differentiated 
5 services; Multiple priorities per user; Flow based queuing 
(not initially supported); Bandwidth Management; Distributed 
processing for allocation of bandwidth on switch-fabric links 
including MPHY links; Distributed processing for allocation of 
bandwidth for logical link management; User Management for up 
10 to 512K users; Tunnel Management for up to 128K users; L2TP; 
IPSec; Multi Protocol Processing; and Support for any 
protocol • 

Traffic Management : 

Traffic Management for an Internet access system is 

15 complex due to the involvement of various system interfaces - 

A system might be connected to users, the Internet backbone, a' 
Local Area Network with file and Web servers, and a 
Metropolitan Area Network (MAN) which gives access to local TV 
and media servers as shown in Fig. 11. Each link has 

20 different link properties with respect to available bandwidth 
and Dollar per Megabyte. This means that a user's share of 
bandwidth on a particular link has to be based on the property 
of this link. A user might get more bandwidth share on the MAN 
link than on the backbone link due to the fact that more 

25 bandwidth at a cheaper price is available on the MAN link than 
on the backbone link. The same is true for bandwidth 
wholesaling of the preferred system to multiple ISPs who would 
like to resell bandwidth to their customers. The enabling 
technology for this model is Secure Segmentation. This model 

30 has also led to the introduction of logical link groups. A 
logical link group can be assigned to a secure segment based 
on the bandwidth needs of the considered secure segment for a 
particular link as shown in Fig. 12. This means that not only 
user allocation has to be considered but also logical link 

35 bandwidth needs to be included. Therefore, bandwidth is 

distributed based on traffic class, user, and logical link 
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group. This supports the wholesaling model and takes into 
account over-subscription requirements in order to support QoS 
including differentiated services. 

The preferred system represents a highly distributed 
5 system. In such a system, resources have to be allocated based 
on the requirements of the traffic of each component. That 
means in general that each component has to take part in a 
distributed computation method in order to allocate the 
resources. The traffic management requirements for bandwidth 

10 allocation within the preferred system will have to include 
bandwidth negotiation for the various flows through the 
switching fabric. One also has to consider the specific 
requirements to support the above-introduced concept of 
logical link groups. Since logical link groups are managed in 

15 a distributed manner, bandwidth information has to be 

exchanged between the entities managing one logical link as 
shown in Fig. 13. Buffer management and QoS Control is an 
integral part of the overall traffic management scheme 
implemented in the preferred server system. Due to the large 

20 buffer, the system has to maintain on various different places 
in the distributed system a sophisticated buffer management 
scheme which has to be implemented and supported by QoS 
control in order to support differentiated services and other 
traffic flow specific requirements 

25 Policing - Traffic Shaping : 

Policing and Traffic Shaping have closely related 
functionality. Policing ensures that the incoming stream does 
conform to the negotiated link parameters for a logical link 
group as well as the user of the incoming link. Traffic 

30 shaping enforces the link parameters for the outgoing traffic 
stream based on the outgoing user, the logical link group and 
the link itself. Fig. 14 is intended to illustrate the need 
for policing as well as traffic shaping. An incoming traffic 
stream is shaped (policed) in order to enforce the traffic 

35 contracts of a user for the considered link and logical link. 
Before the traffic is forwarded to another link, the traffic 
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contracts for this particular link have to be enforced. This 
traffic contract might be much different from the traffic 
contract of the incoming stream. Consider the case where a 
user requests information over the Internet backbone link. 
5 The bandwidth allocated on this link for this user might be 
500 Kbit/s- The logical link bandwidth for the corresponding 
secure segment might be set to 10 Mbit/s. If the user's 
access link to the system uses an ATM connection with an 
assigned rate 1 Mbit/s and no policing is enforced, the user 

10 could use the full 1 Mbit/s. This is possible since the 
traffic shaped onto the user ATM link allows the user to 
transmit the higher rate. Therefore, it is necessary to police 
the incoming traffic and the other for shaping the traffic for 
a particular link. 

15 Fig. 15 shows the schematic implementation of the policer 

and traffic shaper in an IPE within the preferred server 
system. A received cell is assigned to a particular user data 
structure assigned to the incoming link for the considered 
user. As discussed earlier, the policing information can be 

20 directly obtained from the user who is sending a packet based 
on the connection identifier, the corresponding session ID, or 
the IP source address. However, if the packet on the incoming 
connection cannot be directly associated with a user or 
logical link group, then the user and/or logical link group 

25 for whom it is destined classifies the packet. Based on the 
obtained user and logical link information the incoming 
traffic stream is policed by queuing up the packets and 
enforcing the negotiated traffic contract. 

Once the packet conforms to the incoming link 

30 requirements, the packet is shaped based on the user 

parameters and logical parameters for the outgoing link. These 
parameters are obtained from the user connection itself if a 
session ID can be associated with it. If the packet comes from 
a user and is forwarded across the Internet to a remote 

35 tearminal, then the shaping parameters are obtained from the 
sending user for the corresponding link and the associated 
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logical link group. For connectionless traffic, which cannot 
be directly associated with users, logical link group can be 
assigned based on the IP destination address and or source 
address. This allows managing traffic flows between networks. 
5 Switch Fabric Bandwidth Management and Scheduling : 

In order to meet the QoS requirements of individual 
traffic flows and to ensure that delay requirements of certain 
flows can be met, sophisticated scheduling must be conducted 
across the entire switch fabric. This scheduling takes into 

10 account the allocated user bandwidth, logical link share, 
buffer occupancy for output queues, available sub-port 
bandwidth, priority of class of traffic, and expected delay. 
All this is accomplished while maintaining high throughput 
across the switch fabric . 

15 Attached hereto as Exhibit A are details of the 

manner in which the preferred server system is programmed 
so as to minimize inter-IPE card coramunications . 

There are various changes and modifications which 
may be made to the invention, as apparent to those 

20 skilled in the art. However, such changes and 

modifications are suggested by the present disclosure, 
and the invention should therefore be limited only by the 
scope of the claims appended hereto, and their legal 
equivalents . 


25 
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EXmBFTA 

Une Cards -Ingress Side 

Line Cards do not perfonn any traffic policmg. Policmg is petformed, in distributed fashion, by the 
aU P£ catds in the systBOL If damg testing, it can be detennined that the Line Oids have ^ 
processor and I/O bandwiddi topetfonnpoUdog, this function migbt be moved to the line Caids in a 
lutuie version of the software. Also, Lmo Cards do not peifoim any rooting tabtelodcups. 

All IF padcets received, regardless of their encapsulation, must have their destination IP address 
. c^tured and examined by a LC FPU. One operation diat omst be petfonned is detennining if the 
' destination IP address is one of the IP addresses of our Systran. This am be done using a simple hash 
table. A fiinCIDRroutmg search is not necessary, since we are only loddng for an e^^ 
result offhelooki^ Of successfuQ is the Cardid of the IPE that d^ Ifanatchis 
found and die Gardid is eqoal to die Caidld of the IPB ^ Ihe pacto 

padcet must be forwarded widi die Desdnadoa FPU bit set This is so that when die packet is 
received^ die PI can select die packet to be cqitnred in its entirety (as long as it is not part of a non- 
encrypted tnnnBi). 

Additionally, if die packet is an IPsec padcet that has been recdved from a Icarge user^ and die 
destination IP address is one of die addiesses of the IPE to wMch the packet is a 
tc^ die Usedd dioald be dfilemnned b«9ed <A die IP&ec Secnii^ P^ 

d^lia^ of tesoorce and destination IP addcesses in the IP heai^ These t^Miations 

win be discussed in greato- detail in die sections diat follow. 


PIE Header 




tTO^' >^j.^jr3v ..^ Bii2 1 Bfii 


#i 


Stage 

PPU 

- 

mm 1 cm i6:i9 

^ ' 

CID (with FPU access infisnudon) 

¥• ,^ 

RayloadLeng^ 




Data 
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1.1.2 ATM Line Card 

1 .1 ^1 Small Virtual Circuits 

Tlic foUowing infonnation is sent to Ac IPE FPU along with tiie packet payload: 

• In tbe CSIX Header: 

• Destination Ho wid: , ^ ^ 

Sent in flic CSIX Header of evoy ceU of tiie padcct to identify vimere flie switch fetac 
should send it as well as flie priority (cUiss) of flie padcet 

• In Hic CeU Header: 

• Source Flowld: . 

Sent in flic CeU Header to allow the IPE to reassemble flic padcet This is snnply tne 
identification of flie line card (in flie least significant 8 bits) and flie priority (class) in flic 
most significant two bits. The priaity MUST be flw same as is specified in flie 
Destination Flowld of fliis packet 

• Discard Bit . , 

Set by flie LC PM to inform flic IPE flmt an error (IP diecksum, AALS C3tC, mtecoal 
parity) was detected in flie packet 

• EOPBit 

Set by flie LC PM to mdicate flie last cdl of flie packet 
• PIE Header: 

• InitialPID: ^ . ^ ^ ^ ^ 

Bus 3 bit fidd tdls flie IPB flie type of encapsulation fluspadcet has- The choices are: 

IPC (for inter-piocessor comnnmications), IP, PPP, Bfliemet, ATM, or MPLS 

• InitialSt^e: < ^ ^ 

This 4 bit fidd can be used to give additional information to the IPE about tte 
encapsulation of fliis packet It spedfics wfaidi stage hi flie IPE PI wffl be flie first to 
inspect flie packet 

• DestinationPPU(lbU): ^ ^ 

This bit is set for packets T^e destination address is eqM to one or flie IP add^ 

of flic IPE card fliat pack^ is being sent to. 

• Destination PPUID: 

The PPU idoitifier of flie IPE PPU fliat flie padcet is being sent to. 

• IPECQD: 

An mdex into flie connection tdile of flie IPE PPU fliat flie padcet is bdng s^ 

The PI uses flie VPWa and PHYID to calcuhte flie LC Cro, whidi is used by flie PI as m 

flie hardware connection table. The PI reads (amongst oflier flimgs) a LC PPUID whidi selects flie 

LC PPU fliat flie control mformation for flie padcet should be sent to. 

The LC OD is also used by flie LC PPU as an index into a software connection table. Typically 
(fliough not always), fliis connection table is used to determine flie Userld (which consists of a 
Destin^on Cardid, Destination PPUm. and IPE CTO) tiiat is sent to flie IPE m 
padcet As packets arc mspected by flic LC, a defcermmation of priority (one of four classes) is 
based on flie protocols found m flie padat Alternatively, flie priority could be read from flie 
connection table. This priority is used to detemime flie two most sigmficant Ints of flie Destination 
Flowld when flie packet is fiirwarded to an IPE. 

The ATM cell headers and flie AALS trailer and padding arc removed (by flie PM) before forwarding 


The following is a list of all flie different types of top level protocol encapsulation fliat can be reoewcd 
by flic ATM line card, along wifli an eaqilanation of flie processmg that must take place on flie Une 
card for each type of protocol: 
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1.1^1.1 IP (RFC.1483) 

The IP packet is fofwaided to the IPE. The IPE CID is detennined by reading die software 
connection table. 

1.1^1 Ethernet (RFC-1483) 

Each ATM LC must have a standard globally imique Ediemet MAC address permanently assigned to 
it EachEtfaernet/ATM VCshouUbeoMifigaiableastowhedierornotitisin*^ 
that is, whether or not it should discard unicast packets not sent to hs MAC address. 

AU Ethernet packets aie forwarded to the EPE widi their MAC headers intact, except for PPPoE 
session packets (ediertype- 0x8864). For non-PPPoE session packets, the IPE OD is detenniiied by 
reading the software connection table, and the Initial PID is set to indicate an Ediemet packet 

For PPPoE session packets, the PPPoE header is removed, and die PPPoE Session ID (fixHn die 
PPPoE header) is used to index into a PPPoE session table, from wfaidi the IPE CID can be retrieved. 
In tins case die ImM PID is set to mdicate a PPP packet Additionally, if die PPP/PPPoE protocol 
type is P, die PPP header is also removed bef(»e the packet is forwarded to die IPE, and tbe Initial 
PK) is set to indicate an IP packet 

i.lJLI^ PPP 

If the PPP protocol tyjpt is IP, die PPP header is removed, and die Initial PIDS is set to mdicate IP, 
otherwise, die PPP header is kept, and die hulial PIDS is set to mdicate PPP. The IPE CID is 
deterniined by reading the software connection table. 

1.1^1^ ItflPtS 

The top of sta^ dum Ubd (in die AALS FDU) is rqtlaced widi die VPyVd of die virtual drcuit 
The W WO can be deduced from die IX: CID. The IPE CID is detecmm^ 
connection table. 

1.1^ Large Virtual Circuits 

Wifli die exception of die foUowing changes, large virtual dicuits are handled in die same way as 
small virtual circuits. 

The PI DFU control registers can be programmed (by die MPPU)widi die of iq[> to 4 large 

virtual circuits. Fordiesedrcui&,ifdiepadQetamtainsanIPheadar,diePIDFU^rq)laceflieI^ 
PPUID read £com the hardware connection table widi a LC PPUID read fixun a hash table which is 
indexed by a hash of die source and destmation IP addresses of the padcet (caknlatBd fay die PI DFU). 

Any of die entries (ciiGuits) in die software VC counection table can be mad 
multiple IPE PPUs. Tlieseareknownas Jbiseitfer?, and need not be die sarne virtual drc^ 
distributed by die DFU as explained above* For diesedrcnits, if die padoet contains an IP header, a 
new hash is calculated over die source and destinadcRi IP addresses of die padcet and used to select 
one of several Userlds (Destination Caidid, Destination PPUID, and IPE QD) diat are sent to die IPE 
in the PIE Header of the packet Hie most significant bits of die Destination Flowld are still used to 
select the priority (class) of die packet However, in die case ofanlPsec packet addressed to ^ IPE 
card diat die padwt wiU be fidrwarded to, Ae C/ser/J is selected usiqg a di 
hi die IPsec protocol processing section below. 


1.1.3 POS Una Card 

The foUowmg information is sent to die IPE FPU along widi die packet paybad: 
• ini^CSDC Header: 
• Destination Rowld: 


wo 01/67694 PCT/USOl/01003 


39 

Sent in the CSiX Header of every cell of the packet to identify where die switch fibric 
should send it as well as die priority (class) of die packet 
• IntticCeU Header: 

• Source Flowld: 

Sent in die Cell Header to allow die IP£ to reassemble die packet This is singly die 
identification of die line card (in die least significant 8 bits) and the priority (class) in die 
most significant two bits. The priority MUST be die same as is specified in die 
Destination Flowld of this packet 
« Discard Bit 

Set by die LC PM to inform die EPE diat an error (IP diecksum, internal parity) was 
detected in the packet 

• EOPBit 

Set by die LC PM to indicate die last cell of die packet 
• hiibc PIE Header: 

• Initial PID: 

This 3 bit field tells die IPE the type of encapsulation diis padcet has. The dioices are: 
IPC IP, PPP. or MPLS 

• Initial Stage: 

This 4 bit field can be used to give additional information to die IPE about die 
enci^)Sulation of this packet It sped&s whidi stage in the IP£ PI will be die first to 
inspect die packet 

• Destination PPU (1 bit): 

This bit is set for IP padcets whose destination address is equal to one of die IP addresses 

of tfaelFEcarddiatpadcetisbemg sentto. 

• Destination PPUID: 

The PPU identifier of die PE PPU that die packet is being sent to. 

• IPEQD: 

An index mto die connection table of die IPE PPU diat die padcet is being saA to. 


Unless MPLS/PPP/SONET is bemgusei cadi PPP/SONBT PHY conqm^ When 
MPLS is m use, however, each MPLS Labd Swildied Padi (LSF) iq«eseats an additional user. 

For POS Line Cards, die LC OD is sm^ly die PHYID. The H DFU ccntrol r^ers can be 
programmed (by die MPPU)widi die LCOD's of up to 4 PHYs. For diese PHYs, if die packet 
contains an IP header, die PI DFU will iq)lacc die LC PPUID read from die hardware connection 
table widi a LC PPUID read fiom a hash table wUdi is mdexed by a hash of die source and 
destinationIPaddressesofdiepadcet(calculatBdbydicPIDFU). This capabiB^ofdie PI DFU must 
be nsed for (X>192c and OC-48c PHYs in order to distribute die load over mdtipteLCPPU^ For 
0C-12C and smaller PHYs, die PI DFU need not be used. Instead, d» PI uses die LC OD as mdex 
mto the hardware connection table. Ih^e PI reads (amongst odier dmigs) a IX: PPUID wUA sd 
die LC PPU diat die control mfbrmation fior die packet dio^ 

As packets are inspected by die lA a determination of priori^ (one of four cla^ This 
priority is used to determine die two most significant bits of die Destination Flowld when die padcet 
is forwarded to an IPE. 

The following is a list of all die different types of tsxg level protocol encapsulation diat can be received 
ly die POS line card, along widi an eiylanation of die processing diflt must ta^ 
for each type of protocol: 

1.1^.1 PPP Control Protocol (LCP, PAP, CHAP, IPCP, MPtSCP, etc.) 

This category indndes not only PPP Ckintrol Protocols, but also any Netwodc Protocol odicr than IP 
or MPLS. 
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The IX: PPU uses tiu5 IX CD3 (whidi is really just the PHYID) as an m 

This table provides die Primary Userld. which detennines where the pack^ is sent as well as the IPE 
QD that is sent in the PfE Header of the packet For large and small PPP/SONET ttf en, all non-IP 
and non-MPLS packets are sent to die IPE PPU identified by die Primary Userld. No distribution is 
performed for these packets. Also, die Initial PID is set to indicate a PPP packet 

IP 

The LC PPU uses die LC CID (wfaidi is really just the PHYID) to index into and lead fiom die 
software PHY table. From this die LC detennines whediCT this is a small user or a large user. For 
smaU users^ Ike Primary Userld is dlso read Gxmikt¥HY^^ This detennines where die packet is 
sent as weU as die IPE that is sent in die P/E /feoder of die packet 

For large users^ however, a hash is calculated over die source and destination IP addresses of die 
packet and used to select eidicr the Primary UserlD or one of several Secondary VserlDs. The 
selected UserlD detennines vAim die packet is sent as weU as die IP£ CID diat is sent in die PIE 
Header of die packet 

The PPP header is removed before die packet is forwarded to die IPE, and die Initial PID is set to 
indicate an IP packet 

i.1.3.3 MPLS 

As is die case widiIP/PPP/S01^» die LC CID only identifies th^ Therefore^ when die LC 

PI i<^pnt^gg an MPLS padcet; die top of stack label most be c^tnred in order to idoitify die user. 
For each POS PHY, die IX: PPU mnst maintain a table of MPLS LSI^. The LC OD selects which 
table, and die top of stack label is used to index into die table. For smoff users, die Primary UserlD 
that corresponds to die LSP can tiienbe read die table. For large users^ however, a smrilar process to 
die one described above for IP is used. A hash is calcuhted over die source and destmation IP 
addresses of die packet and used to select eidier die Primary UserlD or one of seveial Aeoondorv 
UserlDs. The selected 6!ser//) detennines iRdiere die padcet is sent as weB as dw 
in die PZE //eoJer of the packet 

The PPP header is removed before die padcet is forwarded to the IPE, and die Initial PID is set to 
mdicate an MPLS packet 


1^^4 BhemetUMCant 

Tbe folkwii^ infonnation is sent to die IPE PPU along wifo the packet pay^^ 

• InibRCSDC Header: 

• Destination Flowld: 

Sent in die CSIX Header of every cell of die padcet to identify vdiere die switdi fisbric 
^bould send it as wdl as the prioiily (class) of die packet 

• hifbcC^ Header: 

• Source Flowld: 

Sent in die Cell Header to allow die IPE to reassernble die packet This is simply the 
idoitificadon of d»e Ime caid (in die least significant 8 bits) and die prioiily (class) in die 
most aer^fi«»nt two bits. The piiority MUST be the same as is specified in the 
Destination Flowld of diis padoet 

• Discard Bit: 

Set by the LC PM to infonn die IPE diat an enor (IP checksum, internal parity) was 
detected in the padcet 

• EOF Bit 

Set by the LC PM to indicate die hist cell of die packet 

• luAt PIE Header: 

• Initial PID: 


wo 01/67694 PCT/USOl/01003 


41 

This 3 bit field tells &e IPE the type of encapsulatioa Ibis packet has. The choices are: 
IPC, EfhcTOrt, PPP, BP, or MPLS. 

• Initial Stage: 

This 4 bit field can be used to give additional infonnation to die IPB about die 
encapsulation of this packet It specifies which stage in Hie IPE PI will be die first to 
inspect the packet 

• Destination PPU (1 bit): 

This bit is set for IP packets whose destination address is equal to one of the IP addresses 
of the IPE card diat padcet is bemg sent to. 

• Destination PPUID: 

The PPU identifier of the IPB PPU that die packet is beii^ sent to. 

• IPECID: 

An index into the connection table of the IPE PPU diat die padcet is being sent to. 

ljUess MPLS or PPPoE is bdng used over die Ediemet, eadi PHY comprises a single user. Wh^ 
MPLS or PPPoB is in use» however, each MPLS Label Swifdied Padi (LSP) or PPPoE session 
represents an additional user. 

For Ediocnet Line Cards, the LC OD is simply die PHYID. The PI DFU control registers can be 
prqgismmed (by die MPPU) wrdi die LC ODD^s of iq> to 4 PHYs. For these PHYs, if die packet 
contains an IP header, die PI DFU will iqilace the LC PPUID read fiom die hardware connection 
table widi a LC PPUID read fcom a hash table which is indexed by a hash of die somoe and 
destinationIPaddressesofdiepadcet(calcalatedbydiePIDFU). This capability of die PI DFU mnst 
be used for 10 Gigabit Ediecnet Cards in aidct to distribute die load ov^ muIt^>le LC PPUs. For 1 
Gigabit and smaller PHYs, die PI DFU need not be used. Instead, die PI uses die LC CID as index 
into die baidware cooDsction table. The H reads (anicn^ odier dungs) a LC PPUID ^diidi selects 
die IX! PPU that the control infimnaticm for die padcet should be 

Eadi Bdiemet PHY must have a globally unique Ethernet MAC address pennanent^ assigned to it 

AU Ediemet padtsts are forwarded to the IPE widi dieir MAC headers into 

FIDS set to indicate Ediemet, except for MPLS and PPPoB session packets (edier^pe -0x8864). 

As padcets are inspected by die LC, a detemoination of priority (one of four classes) is made. This 
priority is used to detemoine die two most significant Ints of die Destination Flowld when die padcet 
is forwarded to an IPE 

The following is a tist of aU die difiEisrent types of top levd protocol encapsulation diat can be received 
by the Ediemet line cacd, along widi an eiqilanation of die processing diat mnst take place on die line 
card for each type of protocol: 


Tlie LC PPU uses die LC CID (whkh is really just die PHYID) to index mfeo and read fiom die 
software PHY table. From diis the LC detennines whcdier diis is a ^imzD u^er or a large user. For 
smaU users^^e Primary Userld is also itad horn This determines where die packet is 

sent as wen as die IPE Cn> diat is sent in die /VE/feoJer of die padoet 

For large users^ however, a hash is calculated over the source and destination IP addresses of die 
packet and used to select eidier die Primary UserlD or one of several Secondary UserlDs. The 
sdected UserlD determines whoe die padcet is sent as well as die IPE CID diat is sent m the PIE 
ffeoder of die packet 


The padcet is forwarded to die IPE widi die Edieoiet MAC header mtact, and 
indicate an Ediemet padxt 
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1.1,4^ PPPoE Session . . ^™ o - «x 

For PPPoE Session packets, Ac Ethernet and PPPoE headers are removed, and the PPPoE Session W 
(fiom Ae PPPoE header) is used to index into a PPPoE session table, from which die Userld (IPE 
Cardid, IPE PPUID and BPE CID) can be retrieved. A unique PPPoE Session table can be maintained 
for each PHY, and die LC CID can be used to select which session table to use. 

If Ac PPP protocol type is IP, the PPP header is also removed, and tfic Initial PIDSis set to indicate 
IP, odiervvise, the PPP header is kept, and the Initial PIDS is set to indicate PPP. 

1.1 A3 MPLS ^ T/-m 

The LCCED only identifies Ac PHYIDdiat the packet was received on. Therefore, when the LC PI 

identifies an MPLS packet, die top of stack label mu^ be captured m order to identify the user. For 
each Efliemet PHY, Ac LCPPU must maintain a table of MPLS LSPs. TTie LC CID selects which 
table, and tiie top of slack label is used to index into the table. For small MPLS users, the Primary 
a^er/Ddiat corresponds to flieLSP can then be read ftc table. For large users, however, a snnflar 
process to die one described above for IP is used A hash is calculated ovct die somce and de st ina tion 
IP addresses of tiie packet and used to select either die /Yiwaiy (/jerZ^ 
UserlDs. The selected £Acr/i> dctcmMncs where Ac padat is sent as ^ 
in the P/£ //easier of die packet 

TlMS E&crnct header is removed befro Ac padcrt is forwarded to d» 
indicate an MPLS packet 

1.1A4 Other EtliefiiotlhiBtiM»ls(ARP, PPPoE Disc^^ ,„«n„„ . 

This category inchides aU BAetnet protocols (eAcrtypes) oAcr than IP, MPLS, and PPPdB Session. 

The LC PPU uses the IX: Cro (wmch is really just Ac PHYTO) as an index mto a sof^^ 
This table provides die Primary Userld, whidi detcnnines whrac Ae padcct is sent as well as Ac IPB 
CID Aat is sent in the PiE//eader of Ae packet For to:?e and swflDEAcmet users, Aese packets 
axe sent to Ae IPE PPU identified by Ae Primary Userld. No distribution is per&xmed ftr Acse 
packets. Also, the Initial PID is set to indicate an EAemet packet 

1.2 line Cards - EgrnsSde 

Linec 


JLZl ATMUneCanI 

The following infonnation is leodved fiom IPB FPU along wiA Ae padcct psQfload: 

• In Ae CSDC Header: 

• Destination Flowld: 

Sent in Ae CSVC Header of every cell of die packet to identify where Ae switch fibric 
should send it as well as the priority (class) of die padwt 

• InlSixeCell Header: 

• Source Flowld: 

Sent in Ae CelJ Header to altow Ae LC to reasseoAle Ae padoet This is simply Ac 
identification of Ae IPE card (m Ae least significant 8 bits) and Ae priori^ (class) in the 
most significant two bits. TTie priori^ MUOT be Ae same as is spedfied m die 
Destination Flowld of Ais packet 

• Discard Bit: 

Set by Ae IPB PM to infom die LC Aat an error (internal pari^) was detected in Ae 
packet 

• BOP Bit: 

Set by Ae IPB PM to indicate Ae last ceQ of Ae packet 

• liitLt PIE Header: 

• InitialPID: 
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This 3 bit field tells the LC the type of eocapsulation this packet has. The choices arc: 
IPC (for inter-piocessor commiinicatioDs), IP« Ethernet, PPP, or MPLS 

• Initial Stage: 

Always 0. 

• Destination PPU (I bit): 

Always 0. 

• Destination PPUDD: 

The PPU identifier of the LC PPU tiiat the packet is being sent to. 

• LCCID: 

This is an index into the connection table of tiie LC 

The Destination PPUID selects AeLC PPU that win process the packet The LC CD) is used by Ae 
LC PPU as an index into a software connection table. This connection table provides Ae shaping 
parameters, any additional encapsulation ftat must be added by tiie LC, fhe PHYID, and die ATM 
VPWQ for die padKt 

The priority (one of four classes) is based on die two most significant bits of die Source Flowld in the 
CeQ Header. The priority is used by die TiafiBc Sh^ and die Schedider to detenu^ 
forward die padoet to die FHY. 

Hie ATM cdl headers and the AALS trailer and padding are alw^ added (by die PM) before 
forwarding die padcet to die PHY card. 

The fidlowing is a list of all die diffoent types of t«^ level jnotocol enc^isulation diat can be received 
fiom an IPB by die AIM line card, along widi an explanation of die processing d^ 
on die line card fiar eadi type of pn^tocol: 

1«2«1«1«i IP 

The desired encapsulation for die packet can be «dicr IP/PPP/PPPoE/Ediemet/ATM. IP/PPP/ ATM or 
IP/ATM. The PPU can determine wliidi it is fixirn die connection table. If the encapsulation should 
be IP/PPP/PPPoE/Bdmnet/ATM, die ocnmecdon labk will provide the necessary information to add 
die missing headers. If die encq;»su]ation should be IP/PPP/ATM, a PPP heaOa identi^g die 
protocol as IP is added. Also, die oitxy in die connectiim table may specify diat an LLC header 
should also be added. 

1JL1.1.2 Ethernet ^« , 

Tbe common table may specify diat an LLC header sbouU be added to ^ 

Odierwise the packet is sent as is. 

1.2.1.1^ PPP ^ 

The desired encapsulation may be ddier PPP/PPPoE/Edieraet/ATM or PPP/ATM. Tlie PPU can 
determme^^iich it tsfiom die connection table. If it is PPP/ATM. die packet is sent as is, odierwise, 
die connection table will provide die necessary information to add a PFPoE header and an Ediemet 
Header. 

lAl.l^t MPLS 

As wrdi aU odier encapsulations, die pr^r VPWa is obtained from die connecti^ 

POSUneCant ^ ^ 

The following mfocmation is received fiom OPE PPU along widi die padcet payload: 

• IniiitCSIX Header: 

• DestmationFbwId: 

Sent m die CSIX Header of every cell of die packet to identify where die switdi &bric 
should send it as well as die priority (class) of the packet 
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• In the Cell Header: 

Source Flowld: 

Sent in die Cell Header to allow die LC to reassemble tbt packet This is simply the 
identification of the IPE card (in the least significant 8 bits) and the priority (class) in the 
most significant two bits. The priority MUST be the same as is specified in the 
Destination Flowld of ftis packet 

• Discard Bit 

Set by the EPE PM to inform the LC that an error (internal parity) was detected in the 
packet 

• EOF Bit 

Set by Oe IPE PM to indicate die last cell of the packet 

• Inihc PIE Header: 

• Initial PID: 

This 3 bit field tells die LC die type of encapsulation diis packet has. The choices are: 
IPC (for inter-processor commnnicaticms), P, PPP, or MPLS 

• InitialStage: 

Always 0. 

• Destination PPU (1 bit): 

Always 0* 

• DestmationPFUID: 

The FPU identifier of die LC PPU diat die packet is being sent to. 

• LCOD: 

This is an index into the oHuiection table of die LC. 

The Destination PPUID selects die LC FPU that win process die padcet The LC OD is used by die 
LC PPU as an index into a software connection tabk. This connection table provides die shaping 
parametral, anddie PHYID for die packet 

The fidlowiiig is a list of all die difiGBmit types of top level protocol oicapsulation diat can be received 
by the POS line card, along wrdi an explanation of the processing diat must take place on the line card 
for each type of protoook 

1.2J2.1 PPP 

No additional pxocessmg is needed. 

IP 

A PPP header identifying die pad^t as an IP packet is added. 

1JL2^ MPLS 

A PPP header identifying die padcet as a MPLS packet is added. 


1,23 Ethernet Line Card 

The following information is recdved from IPE PPU along widi die packet payload: 

• In die CSDC Header: 

• Destination Flowld: 

Sent in die CSIX Header of every cell of die packet to identify where die switch fi^ric 
should send it as well as die priorit/ (class) of die packet 

• Inf^CeU Header: 

• Sonrce Flowld: 

Sent in the Cell Header to allow die LC to reassemble die packet This is shnply die 
identification of the IPE card (in die least significant 8 bits) and die priority (class) in 1h.e 
most significant two bits. The priority MUST be die same as is spedfied in die 
Destination Flowld of tins packet 

• DiscardBit: 
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Set by the IPE PM to infoim the LC that an error (internal parity) was detected in the 
packet 

• EOP Bit 

Set by the IPE PM to indicate the last ceU of the packet 
• hittit PIE Header: 

• Initial PID: 

This 3 bit field tells the LC &e ^e of encapsulation Ms packet has. The choices are: 
IPC (for inter-fHTOcessor communications), Efliemet» PPP^ IP, or MPLS 

• Initial Stage: 

Always 0. 

• Destination PPU (1 bit): 

Always 0. 

• Destination PPUID: 

The PPU identifier of the LC PPU that the packet is being to. 

• LCCID: 

This is an index into the connectitm table of &e LC. 

The DestmationPPUn) selects the LC PPU that wiQ process Oepac^ The LCCID is used by Oe 
LC PPU as an index into a software connection table. This connecdon table provides Ae shs^ing 
parameters, and the PHYID for die packet 

The foUowii^ is a list of all die diffioent types of top level protocol encsqisuMon that can be received 
by die Bdieznet line card, aloiig widi an explanaticm of file procesm^ 
card for each type of protocol: 

Ethernet 

No additiCMial processing is needed. AH XP/Edieniet are sent usmg this type>)ecaiise die IPE, not the 
LC inq)lemaits ARP, and dierefore adds die Ediemet header to all IP packets before sending diem to 
dieLC. 

1.2.3i2 PPP 

The desired encapsulation is PPP/PPPoE/Ediecn^ The connection table provides the necessary 
information to add a PPPoB header and an Ediemet header. 

1JL3^ IP 

The desired enc^)Solatian is IP/PPP/PPPoE/Edieinet A PPP header mdicating an IP packet is added. 
The connection table dien provides Ae necessary inftKmafion to add a PPP6E header and an Efliemet 
header. 

lAPLS 

The connection table provides die information needed to add an Ediemet header (die destination 
MAC address is afl diat is required fiom the coimection table). 


13 IPEC&itl 

IPE In ffre^ Protocols 
All packets recdved by an IPE card fiom die Lme Cards (or fiom odier IPEs) will be of one of die 
following types. The Initial PID field in die PIE Header will identify which one of diese types each 
padoet corresponds to. Kdiere are more dian 8 sudi types, die Mtial Stage field m die Pi^/fe<ufer 
can be tised to select a different stage to begin inspection, each of which allows 8 additional protocols 
to be identified by die inWalPW field. 

The IPE QD and PPUID in die PIE Header of die received padc^ combine widi the Flowld to give 
dieUserld. Onlydieleastsignificant 18ofdie20bitsof dielPECIDareused. 
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IPC . . ^ 

These packets are used for inter-processoT conununication within the system. The PI should be 
programmed to capture these packets to a PPU (as specified in the Destmation PPU field m the PIE 
Header) in dieir entirety. 

' These packets consist of only an IP packet That is, an IP header, followed by an IP payload (>nAidi 
might inchide TCP, UDP, ICMP, etc.). This category does not inchide IP packets received over 
MPLS or over Ediemet 

These packets can come fiom eiflier a POS LC, an ATM LC, or an Edieroet LC The possible 
enc^)Sulations fliat could result in such a packet arc: IP/ATM, IP/PPP/ATNA, IPAPPP/SONBT. 
IP/PPP/PPPoE/Ediemet, and IP/PPP/PPPoE/Bdiemet^AIM. 

The IPE CID uniquely identifies die PPPoE Session ID, or die AIM Virtual Circuit diat the packet 
was received on as well as die PHY/LC diat it was recdved on. In die case of IP/PPP/SONET, die 
IPE QD wiU identify only die PHY/IX:t diat die pad»t was received 
an IP/PPP/SONET pad^ xeoeived fimn a particular PHY/LC 


PPP 

This category consists of all PPP packets received whose PPP protocol type was not IP or MPLS. 
TliescpadcetscattcomefixmaPOSIAaQATMIX^oranEdiernrt For diose PPP sessions diat 
win be tunneled using L2IP, die IPE must add a new PPP header to die IP/PPP and MPLS^PP 
packets, smce for diose protocols, die PPP header win have beat removed by the Une Card 

Tbe possible enc2^)sulations diat could result in such a paclret are: PPP/SONET, PPP/PPPidB/Bdiemet, 
PPP/ATM* and PPP/PFP«dB/Ediemel/ATM. 

The IPB CID uniquely identifies die PPPoE Session ID, or die ATM Virtual Orcuit diat die packet 
was received on as wen as die PHY/LC diat it was received on. IndiecaseofPPP/SONET,dieIPE 
CID will identify only die PHY/LC diat die packet was received cm, diat is, it will be constant for aU 
PPP/SONET padcets received fiom a particular PHY/LC. 

eiiiemet ^ ^ . ^ ^ ^^^^ 

This category consists of an Native Edianet or Ediemet/ATM packets received cxcqit for MPLS 
(edicrtype = Ox????) and PPPoE data padccts (edicrtype = 0x8864). This category also inchuks 
packets whose destinatiwi MAC address fa not equal to die MAC address of die PHY on which die 
packet was received (broadcast and msdticast packets, and unica^ 

The possible encapsulations that could result in such a packet are: ARP/Ediemet, IP/Ediemet, PPPoE 
Discoveiy/Ediemet, ARP/Ediemct/ATM. P/Eflicmet/ATM, and PPPoE Discovcry/Btficmet/ATM. 

For Ediemet/ATM, die BPE CID uniquely identifies die ATM Virtual Circuit diat die padcet was 
received on as wctt as die PHY/LC diat it was received on. In the case of Native Ediranet, die IPE 
CID win identify only die PHY/IX: diat die pac^ was received on, diat i^ 
packets received from a particular FHY/LC. 

1A1.5 iflPLS 

This category consists of packets which begin widi an MPLS label stack. These can come nom a 
POS LC, an ATM LC or an Ediemet LC 

The possible encapsulations diat oould result in sudi a padoet are: MPLS/PPP/SONET, 
MPLS/Ediemet, and MPLS/ATM. In die case of MPLS/ATM, die Line Card wiU have replaced the 
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top of stack shim label with die real label because the leal label was encoded as the ATM VPI/VQ in 
the packet received from the network. 

The following encapsulations are NOT supported: MPLS/PPP/PPPoE^themet, MPLS/PPP/ATM, 
MPLS/PPP/PPPoE/Ethemet/ATM, and MPLS/Ediemet/ATM. 

The IPE CID uniquely identifies the same as the tq) of stack incoming top of stack MPLS label, as 
well as the PHY/LC that it was received on. In the case of MPLS/ATM Ac top of stadc label has a 
one to one conespondence with the ATM Viitual Ciicait that die packet was received on. 

13.2 IPE Ingres Pmtxfcol Decoding 

The following table shows the fiist two layers of protocols diat must be identified by die PI on die IPE 
fx each packet diat passes duough it 


IP 


Ediernet 

IP 

ARP 

PPPoE Discovery 

MPLS 

IP 

PW 

Control Protocols 


13^1 IP Packets 

AH IP pad^ xeccived by die PE, whedier still encapsulated widi Ediemet, widi MPLS, or wiflioot 
enc^)Sulation, will fidl into one of two categories: tiwse for whidi die destination IP address is equal 
to one of die addresses of die IPB, and diose for whidi it isn^t hi die case of die latter, die pacJcet 
mnstbeftffwaidedordiscankdbydiePPU. But for die former, it xmist be determined whcdier or not 
die padcet can be imcess^ entirely by die PPU, or whedier it must be sent to die MPPU fra 
piocessing. If itmustbesenttodieMPPU.itinustbecapturedmitseiitiiety. 

All IP packets received^ regardless of their encapsulation, must have dim destination IP address 
captured and examined. All routing table searches are performed by die IPE cards. If diedestmation 
address is one of die system's IP addresses, but not one of die IPB card's addresses, die packet must 
be fbrwarded widi Desdnation PPU bit set 


1^.2 L2TP Tunnels 

EadiL2TF tunnel is handled entirely by a particular IPE card Each session widim die tunnd must be 
handled entirely by a particular PPU. This requirement comes primarily fiom die need to suf^ort 
sequence numbers on die data sessions: 

RF02661: ''Each peer maintains separate sequence numbers for the control connection 
and each individual data session within a tunnel " 

Therefore, large PPP users cannot be tunneled. All L2TP control padcets are forwarded to and 
processed by die MPPU of die IPE card. 

1.3.2.2.1 UTP Access Concentrator (LAC) 

Any PPP user can be selected for LZIP tunneling by die IPB MPPU. If a user is selected fcH* 
^«w#>!tfi5 ttoi the PPU rcceivmg PPP packets from diat user must encapsulate diose padoets, first 
widi an L2TP header, dien a UDP header, and finally an IP header. The IP header's destmation 
address will be diat of die configured LNS, and die source address will be one of die IP addresses of 
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IPE The resulting IP packet can thai be forwarded using the standard IP forwarding procedure to the 
appropriate Line Card for transmission. It should be evident that tunneled PPP users on diffcroit IPE 
cards will be placed in sq>arate tun&eb even if being tunneled to the same destination LNS. 

IP packets received from &c LNS will be sent by the receiving Line Card to the IPE PPU associated 
wifli Ac ingress interfece (user), Tliis PPU may well be on a different IPE card tiian the one handlmg 
thetunnel This is easily deternuned fiom the destmation IP address of die packet In this case, die 
PPU receiving the packet ftom flic line Card must forward the packet to tibe IPE card h a n d lin g the 
tunnel In addition, flic L2TP Session ID can be used to identic which PPU on fliat IPE card should 
receive flic packet (fliis PPUID must be sent m flie PIE Headar so fliat flie receiving PI will know 
which PPU should receive flie packet). This is done by always eaoodiog flic PPUID of die PPU 
iiandling a particular session m flie most significant four bits of the L2TP Session ID. 

RF0266!: "Since L2TP sessions are named by ident^iers thai have local significance 
only. Thai is, the same session win be given different Session Ids by each end of the 
session. Session ID in each message is thai of the intended recipient, not the sender, " 

Jbc PPU to vdiich the packet is sent to can in tum can de-encqisulate flie PPP padcet and forward it 
to die PPP user ideotified by flie L2IP session ID. 

1.^,2.2r^ L2TPNetvtforic Server (LNS) 

When fimctioning as an I2^S,L2TP packets received fiom flie lACwffl a 
Line Qtrd or anoflierIPE» to flie IPE handling flie tunnel This is became flie destination IP address of 
flie padcet wiU be eqoal to one offlic IP addresses offlie IPE handling flie tunnel Wifliinfliat IPE, flie 
PPU fliat should process flie L2TP session is identified using flie most signifksnt four bits of die 
L2TP Session ID. The PPU will de^ncapsnlate die PPP pactet, dien process flic PPP packet as if it 
was received from a PPP user. Fitom flns point on, the processing is fllie same as for a ^^reaT PPP 
user. 

In die oflier direction, packets whidi, whoi flieir destination IP address is looked up in the routing 
tablf , yjjyM % ^ Wiwatin n PPP uxer fliflt k assoeiaied wifli a L2TP tmmcl instead of wifli a line Card, 
must be sent to die IPE PPU handling flic PPP user. This is because of flic seqnence miniber 
requirement of L2TP mentioned above. Once received flns PPU. die packet must have a PPP header 
added, as is die case wifli a normal PPP user. Then, instead of forwar^ng die padcet to a line Card, a 
L2TP header Padded, followed by a in)P header and an IP header. The IP destmation address is diat 
ofdieLACatdieodier endof flictiumcl The resulting IP padcet can flien be forwarded using die 
stBiidaid IP forwarding procedure to flie appnvpiiatB line Card transmissioa 


IPSec Tunnels 

Each IPSec Security Association (SA) is handled entirely by a particular IPE PPU. As defined m 
RFC-2401, a Security Assodation is a unidirectional, **^mplBxr connection fliat provides security 
sovices to die tcafiOc carried by it 

1 J^^Ai Inbound IPsec proees^ng 

• Plainpadcets 

Every PPU most have a copy of die SPD for every user torn whidi it receives packets. In oflier 
words, for every UserW (Primary or Secondary) fliat pomts to a particular IPE PPU. die PPU must 
have a pointer to an SPD. If a user's traSBc is split ammgimiltqplePPUs {Lcisl large user)^ flien they 
shouki have identical SPDs configured for die usrer, and each will create its own set of Security 
Assodations finr its share of flic laer's traffic. Every packet received must be processed using die 
SPD of the user die jpadoet is received fiont 
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The SPI is Ac field m the IPSec header diat, along with the destination IP address, identifies the SA. 
Traffic fiom a small user will always be directed by recdving Line Card to a particalar PPU. This 
PPU uses the SPI to identify die S A, and thus has access to die information it needs to decapsulate die 
packet For large users, however, the Line Card must detect IPsec packets whose IP destination 
address is one of die addresses that belongs to the BPE card identified by the user's Primary Userld. 
Rafter dian select a Userld (primary or secondary) based on die hash of the source and destination IP 
addresses of die packet, the LC must use die SPI in the IPsec header to select die Userld^ and dnis die 
IP£ PPU, to send die packet to. In order to accomplish dus, die most significant 4 bits of an SPI 
always contain die PPUID identifying die PPU diat is handling die S A identified by diat SPL 

1^2.3J& Outbound IPsec processing 

Since IPSec performs tunneling at Layer 3, entire users don^t get tunneled. Rather, each packet about 
to be sent to a user is mdividually examined using die Security Policy Database (SPD) associated with 
diat itfer, firom which a pointer to a SA (in the SAD also associated widi die user) is obtained 

The difficulty widi oudxiund processing is tha^ as discussed eadier, die configuration mformation 
(and tiins die SPD) associated with the egress uteris not readifyavaiUble. Theinformatioiimnstbe 
requested fiom the PPU identified by die /Vimai3' (ZrerAf and stoced in a cache. Eadi PPU sending 
to a tiserw31 thus create its own set of Security Associations. 


UJ IPS Packet Fdrwarcffng ana Egress Processing 

The IF£ card PPUs peifomis routing table searches for all packets diat need forwarding. The global 
I^yrwazding biformation Base (FDB ) is distributed to e veiy FPU in die system, and contains IP unicast 
and multicast routing tables in a fiirm diat &cililates longest matdung prefix searches (Le.: Patricia 
tries), as well as tables requhed for MPLS label based forwarding. 

One of die results of every routing table lookiqi is the Primary Userld identifying die layer 2 interface 
1^ which the packet shoidd be transmitted. It is hiQXjrtant to note diat die Primary Userld is not the 
same as die LCUserld, and does not direcdy give die CardId of die Lme Card where die packet should 
be forwarded. Radier, die Primary Userld identifies the IPE PPU that maintains die configuration and 
state infonnationfiir die user. 

This presmts a con^Ucation because die IPE diat is trying to forward die packet needs die 
• infonnation stored on die IPE PPU idoitified by die iVvnto^ Radierdmnshnply forward die 

packet to die other IPE for ^ress processing, ^^ch would result in additional latency and switdi 
febric bandwiddi utilization for cvay packet, it sends a message to ifie PPU identified by die Primary 
Userld, requesting a copy of the user's configuration. This information is kqit in a user configuration 
cache and is used for all subsequent padcets directed to the same user. AH counters and statistics diat 
need to be maintained for each user must also be maintained for each cached user, and must also be 
periodically sent to die PPU identified by die Primary Userld 

This process makes it difficult to implement such functionality as per-user traffic shapmg in die IP£ 
PPU, because the processing would need to be distributed among a potentially ko^e number of 
processors. Therefore, traffic shaping is to be implemented stricdy on die Line Card using die egress 
PPUs. 

One of die fields diat is acquired and cadied as part of die user configuration infomoation is die 
LCUserld, This field contains die CardId of die Line Card diat the packet must be forwarded to, as 
weU as the PPUID and QD that should be sent in die PIE header of the packet to diat 
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What is claimed is; 

1. A packet processing circuit comprising: 

a packet inspector for examining a stream of cells 
to determine control information for packets represented 
thereby; 

5 at least one buffer access controller connected to 

said packet inspector for storing at least a portion of 
data cells received from said packet inspector, and for 
processing control information received from said packet 
inspector to produce additional control information; and 
10 a packet manager connected to said buffer access 

controller for receiving control information therefrom 
for use in formatting packets corresponding to said 
control information. 

2. The circuit of claim 1 wherein said packet 
manager is configured, for using the control information 
received from said buffer access controller to reassemble 
said corresponding packets. 

3. The circuit of claim 2 wherein said packet 
manager is connected to said packet inspector for 
coordinating the dequeuing of data cells representing 
said corresponding packets from said buffer access 

5 controller - 

4. The circuit of claim 1 further comprising a cell 
buffer associated with said buffer access controller for 
storing said data cells. 

5. The circuit of claim 4 further comprising at 
least one protocol processing unit associated with said 
buffer access controller for processing said control 
information received from said packet inspector. 

6. The circuit of claim 5 wherein said protocol 
processing unit comprises at least one general purpose 
processor unit. 
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7. The circuit of claim 5 further comprising an 
additional buffer access controller connected to said 
packet inspector, wherein said buffer access controllers . 
are configured for storing different portions of data 

5 cells received from said packet inspector. 

8 . The circuit of claim 7 further comprising a 
protocol processing unit associated with said additional 
buffer access controller, and wherein said buffer access 
controllers are each configured for determining whether 

5 to forward certain control information received from said 
packet inspector to its associated protocol processing 
unit for processing. 

9. The circuit of claim 8 further comprising a 
master processing unit connected to said protocol 
processing units for providing said protocol processing 
units with configuration data. 

10. The circuit of claim 9 further comprising a 
switch, wherein said master processing unit and said 
protocol processing units are interconnected to one 
another through said switch. 

11. The circuit of claim 7 wherein each buffer 
access controller has at least two protocol processing 
units associated therewith. 

12. A mid-network server comprising: 

an input for receiving a packet delivered thereto; 
a line module connected to said input for receiving 
said packet; 

5 a plurality of processing modules for performing 

mid-network processing functions; and 

a switch fabric connected to said line module and 
said processing modules for delivering packets 
therebetween, wherein said processing modules are at 
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10 least stibstantially identical to one another and 
independently programmable. 

13. The server of claim 12 further comprising an 
additional line module, wherein said line modules are at 
least substantially identical to one another and 
independently programmable. 

14. The server of claim 12 wherein said processing 
modules are each configured to support a plurality of 
packet types, and each line module is configured for 
formatting a packet into one of said types prior to 

5 sending said packet through said switch fabric to one of 
said processing modules. 

15. The server of claim 12 wherein said line module 
and said processing modules each comprise a packet 
inspector, a packet manager, and at least one buffer 
access controller . 

16. The server of claim 15 wherein said line module 
and said processing modules each comprise a plurality of 
buffer access controllers interconnected with said packet 
inspector and said packet manager. 

17. The server of claim 16 wherein each of said 
buffer access controllers have at least one protocol 
processing unit associated therewith. 

18. The server of claim 17 wherein each protocol 
processing unit is in communication with at least one 
other protocol processing unit on the same module. 

19. The server of claim 18 wherein said line module 
and said processing modules each comprise a master 
protocol processing unit for controlling the protocol 
processing units on that module. 

20. The server of claim 19 wherein said line module 
and said processing modules each comprise an Ethernet 
switch for interconnecting the master protocol processing 
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unit with said other protocol processing units on that 
5 module . 

21. A packet server comprising: 

an input for receiving a packet delivered thereto; 
a line module connected to said input for receiving 
said packet; 

5 a plurality of processing modules for performing 

packet routing functions; and 

a switch fabric connected to said line module and 
said processing modules for delivering packets 
therebetween, wherein said line module is configurable to 
10 send said packet to any one of said processing modules 
through said switch fabric, and said processing" modules 
are each configurable to perform said routing functions 
for said packet if said packet is sent thereto by said 
line module. 

22. The server of claim 21 wherein said line module 
supports a plurality of user interfaces and is configured 
to send said packet to one of said processing modules 
according to the user interface through which said packet 

5 arrives at said server. 

23. The server of claim 22 wherein each processing 
module includes a plurality of processing units, and said 
line module is configured to send said packet to one of 
said processing units of one of said processing modules 

5 according to the user interface through which said packet 
arrives at said server. 

24. The server of claim 23 wherein said processing 
modules are each configured to support a plurality of 
packet types, and said line module is configured for 
formatting said packet into one of said types prior to 

5 sending said packet through said switch fabric to one of 
said processing modules. 
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25. The server of claim 21 wherein said line module 
is connected to said input through a phy module. 

26. The server of claim 21 wherein said line modul-e 
and said processing modules each include a plurality of 
general purpose processing units. 

27. The server of claim 21 wherein said line module 
and said processing modules can be programmed to support 
any type of transmission protocol. 

28. A packet server comprising: 

a plurality of line modules for receiving packets 
delivered to said server over a physical connection; 

at least one processing module for performing packet 
5 routing functions; and 

a switch fabric connected to said line modules and 
said processing module for delivering packets 
therebetween, wherein each line module is configured to 
format a packet into one of a plurality of types prior to 
10 sending said packet through said switch fabric to said 
processing module, and said processing module is 
configured to support each of said packet types. 

29. The server of claim 28 wherein said processing 
module includes a plurality of processing units. 

30. The server of claim 29 wherein each line card 
supports a plurality of users and is configured to assign 
each user to one of the processing units of said 
processing module. 

31. The server of claim 30 wherein at least one of 
the processing units of said processing module is 
assigned to a first user supported by a first one of said 
line modules and a second user supported by a second one 

5 of said line modules. 

32. A method for processing packets within a 
server, said method comprising the steps of: 
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converting a packet input to said server into a 
stream of fixed length cells; 
5 processing said stream of fixed length cells using a 

line module to format said packet into one of a plurality 
of protocol types; and 

sending said formatted packet to a processing module 
configured to support each of said plurality of protocol 
10 types. 

33. The method of claim 32 wherein said processing 
step includes reassembling said packet. 

34. The method of claim 33 wherein said processing 
step further includes examining said cell stream to 
obtain control information for said packet. 

35. The method of claim 34 wherein said control 
information includes information identifying a particular 
processing module for further processing said packet. 

36. The method of claim 34 wherein said processing 
step further includes processing said control information 
to produce additional control information for use in 
reassembling and formatting said packet. 

37. The method of claim 36 wherein said processing 
step further includes identifying a particular processing 
module to which said packet should be sent. 

38. The method of claim 37 wherein the sending step 
includes sending said packet to said particular 
processing module identified in said processing step. 

39. The method of claim 37 wherein said identifying 
step includes identifying a particular protocol 
processing unit on said particular processing module for 
processing control information corresponding to said 

5 packet. 


wo 01/67694 


PCT/USOl/01003 


56 

40. The method of claim 33 further comprising the 
step of performing mid-network processing functions on 
said sent packet using said processing module. 

41. The method of claim 40 wherein the step of 
performing raid-network processing functions includes 
formatting said packet for its destination interface. 

42. The method of claim 41 further comprising the 
step of sending the packet foinnatted by said processing 
module to a line module corresponding to said destination 
interface. 

43. The method of claim 32 wherein said sending 
step includes sending said packet through a switch 
fabric. 

44. The method of claim 32 wherein said input 
packet is a packet represented by a plurality of fixed 
length cells. 

45. The method of claim 44 wherein the converting 
step includes modifying the length of said input cells. 

46. A method for processing packets within a 
server^ said method comprising the steps of: 

converting a packet input to said server into a 
stream of fixed length cells; 
5 processing said stream of fixed length cells using a 

line module to format said packet into one of a plurality 
of protocol types; and 

sending said formatted packet to another line module 
configured to support each of said plurality of protocol 
10 types. 

47. A method for processing packets within a 
server, said method comprising the steps of: 

converting a packet input to said server into a 
stream of fixed length cells; 
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5 processing said stream of fixed length cells using a 

line module to format said packet into one of a plurality 
of protocol types; 

sending said formatted packet to a processing module 
configured to support each of said plurality of protocol 
10 types; and 

processing said stream of fixed length cells in said 
processing module; 

reformatting said formatted packet into one of a 
plurality of protocol types; and 
15 sending said reformatted packet to another 

processing module. 
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